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CDS 



AF232006 
Pseudomonas 
(hrpW) , and 
AF232006 
AF232006. 1 



20052 bp DNA linear BCT 23-MAY-2000 

syringae pv. tomato strain DC3000 AvrE (avrE) , HrpW 
GstA (gstA) genes, complete cds; and unknown genes. 

GI:8037790 



Pseudomonas syringae pv. tomato 
Pseudomonas syringae pv. tomato 

Bacteria; Proteobacteria; gamma subdivision; Pseudomonadaceae; 
Pseudomonas . 

1 (bases 1 to 20052) 

Charkowski, A. O. , Alfano,J.R., Preston,G., Yuan, J., He,S.Y. and 
Collmer, A. 

The Pseudomonas syringae pv. tomato HrpW protein has domains 
similar to harpins and pectate lyases and can elicit the plant 
hypersensitive response and bind to pectate 
J. Bacteriol. 180 (19), 5211-5217 (1998) 
98422476 

2 (bases 1 to 20052) 

Alfano,J.R., Charkowski, A. O . , Deng,W.L., Badel,J.L., 
Petnicki-Ocwie ja, T . , van Dijk,K. and Collmer, A. 

The Pseudomonas syringae Hrp pathogenicity island has a tripartite 
mosaic structure composed of a cluster of type III secretion genes 
bounded by exchangeable effector and conserved effector loci 
contribute to parasitic fitness and pathogenicity in pi 



Proc. Natl 

20243785 

10781092 



Acad. Sci. U.S.A. 97 (9), 4856-4861 




3 (bases 1 
Alf ano, J. R. 



to 20052) 
and Collmer , A. 
Direct Submission 
Submitted ( 07-FEB-2000 ) Dept. 
Parkway, Las Vegas, NV 89154, 



Sci., UNLV, 1854 Maryland 



tomato 



Biol . 
USA 

[WARNING] On Mar 14, 2001 this sequence was replaced by a newer 
version qi : 13325077 . 

Location/Qualifiers 

1. .20052 

/organism="Pseudomonas syringae pv 
/strain="DC3000" 
/db_xref="taxon: 323" 

/note= M conserved effector locus flanking the hrp/hrc 
cluster 

pathovar : tomato" 
complement (1082 . .2569) 
/note="ORFl" 
/ codon_start=l 
/transl table=ll 
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/product= " unknown" 
/protein id=" AAF71498 . 1 " 
/db_xref ="GI : 8037791" 

/translation= " MPAVAFPVS S PRLLARAVQI AVLAMGALCVGCQS VDY S PPRQDR 
PPRLVSGLVYWGGDDKEGSRAADKISDLPVYNGEDVWQRVAQRCRLVDGQGMNERIAR 
QRDWLLSNRGFITGASVRASPYLHFIVERLDERNMPLELALLPMIESSYNPMANSPAA 
AAGLWQFIPSTGRSFNLHQSATYDARRDWASSKAAMDYLTRLHDQFNNDWLLALAAY 
NAGEGTVGRAIE7\NRRRGLPVDYWNLNLPKETQDYVPRLLALSMWRNPNAYGVKLTP 
VANT P Y FDWE LNHAVD LT Q LAAT AGV D E GQLLRLNS AFLRKKT FDGPGRLL I PKTQN 
RVLTTSIARITGESPVTASIPLVTPSAIVKRAAERLPAPAPVAERPVRVAERPPLPKP 
VQSAQ LAENL P S PKTAQ PL PVAQQ PT APE ALQ PAPAAE S APAVE I AQ SAP WE PD PLT 
NRIQLPRIRDDRSSPRKRDDDEYRSGPRELPTGPRVWFASDPRQ" 

gene 2790. . 8178 

/ gene="avrE" 

CDS 2790.. 8177 

/gene="avrE lf 
/codon_start=l 
/transl table=ll 
/product="AvrE " 
/protein id=" AAF71499 . 1 " 
/db_xref="GI : 8037792" 

/ translation="MQSPSIHRNTGSIIQPTVTPDARAATDLQERAEQPRQRSSHSLS 
SVGKRALKSVGKLFQKSKAPQQKAATPPTAKNVKTPPPASNVATPRNKARESGFSNSS 
PQNTHRAPKWI LRNH PNQASS S GAQTHE I H PEAAPRKNLRVR FDLPQDRLERS PS Y LD 
SDNPMTDEEAVANATRQFRSPDSHLQGSDGTRISMLATDPDQPSSSGSKIGDSDGPIP 
PRE PMLWRSNGGR FE LKDEKLVRNSE PQGS IQLDAKGKPDFST FNT PGLAPLLDS I LA 
TPKQTYLAHQSKDGVHGHQLLQANGHFLHLAQDDSSLAVIRSSNEALLIEGKKPPAVK 
MEREDGNIHIDTASGRKTQELPGKAHIAHIT^AALLSHDGERMRVHEDRLYQFDPISTR 
WKI PEGLEDTAFNSLSTGGNGSVYAKSDDAWDLSSPFMPHVEVEDLQSFSVAPDNRA 
ALLSGKTTQAILLTDMS PVIGGLTPKKTKGLELDGGKAQAAAVGLSGDKLFIADTQGR 
LYSADRSAFEGDDPKLKLMPEQANFQLEGVPLGGHNRVTGFINGDDGGVHALIKNRQG 
ETHSHALDEQSSKLQSGWNLTNALVLNNNRGLTMPPPPTAADRLNLDRAGLVGLSEGR 
IQRWDATPECWKDAGIKDIDRLQRGADSNAYVLKGGKLHALKIAAEHPNMAFDRNTAL 
AQTARSTKVEMGKEIEGLDDRVIKAFAMVSNKRFVALDDQNKLTAHSKDHKPVTLDIP 
GLEGDIKSLSLDEKHNLHALTSTGGLYCLPKEAWQSTKLGDQLRARWTPVALPGGQPV 
KALFTNDDWLSAQIEDAEGKGLMQLKAGQWQRFEQRPVEENGLNDVHSRITGSNKTW 
RI PKTGLT LRMDVNT F GR S GVE KS KKAS TSEFI RAN I Y KNTAE T PRWMKNVGDH IQHR 
YQGRLGLKEVYETESMLFKQLELIHESGGRPPARGQDLKARITALEAKLGPQGATLVK 
ELETLRDELENHSYTALMSIGQSYGKAKNLKQQDGILNQHGELAKPSVRMQFGKKLAD 
LGTKLNFKSSGHDLVKELQDALTQVAPSAENPTKKLLGTLKHQGLKLSHQKADIPLGQ 
RRDASEDHGLSK7\RLALDLVTLKSLGALLDQVEQLPPQSDIEPLQKKLATLRDVTYGE 
NPVKVVTDMGFTDNKALESGYESVKTFLKSFKK7UDHAVSV^^4RAATGSKDQAELAGKF 
KSMLKQLEHGDDEVGLQRSYGVNLTTPFIILADKATGLWPTAGATGNRNYILNAERCE 
GGVTLYLISEGAGNVSGGFGAGKDYWPGFFDANNPARSVDVGNNRTLT PNFRLGVDVT 
ATVAASQRAGWFNVPDEDIDAFVDDLFEGQLNPLQVLKKAVDHESYEARRFNFDLTA 
GGTADIRAGINLTEDRDPNADPNSDSFSAWRGGFAANITVNLMTYTDYSLTQKNDKT 
ELKE GGKNR PR F LNNVT AGGQ LRAQ I GGS HT APT GT PASAPGPT PAS QT AANNLGGAL 
NFSVENRTVKRIKFRYNVAKPITTEGLSKLSKGLGEAFLDNTTKAKLAELADPLNARY 
TGKKPDEVIQAQLDGLEELFADIPPPKDNDKQYKALRDLKRAAVEHRASANKHSVMDN 
AR FETS KTNLS GLS SE S I LTKIMS S VRDAS APGNATRVAE FMRQDPKLRAMLKEME GS 
IGTLARVRLEPKDSLVDKIDEGSLNGT^f^QSDLSSMLEDRNEMRIKRLWFHTATQAE 
NFTSPTPLVSYNSGANVSVTKTLGRINFVYGADQDKPIGYTFDGELSRPSASLKEAAG 
DLKKEGFELKS " 

CDS complement (8226. . 8621) 

/note="ORF2" 
/codon_start=l 
/transl table=ll 
/product= 11 unknown" 
/protein id=" AAF7150Q . 1 " 
/db xref="GI: 8037793" 
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/translation="MTMKTSQPDFARFINSLGAQLGTSLTLQNGVCALYDGQNNEAAI 
IELPEHSEMVIFHCRIGRCPERAPDLLRLLSLNFDVARLHGCWFAVDQGDVRLCAQRE 
LAS LDE PAFCDVTRGFI SQAREARAFLQA" 
CDS complement (8960. .10831) 

/note="ORF3 M 
/codon_start=l 
/transl table=ll 
/product="unknown" 
/protein id=" AAF71501 . 1 " 
/db_xref ="GI : 8037794" 

/translation="MISSRIGGAGGVKLSRVNQQHDTVPAQTAHPNAVTAGMNPPLTP 
DQSGSHATESSSAGAARLNVAARHTQLLQAFKAEHGTAPVSGAPMISSRAALLIGSLL 
QAEPLPFEVMAEKLSPERYQLKQFQGSDLQQRLEKFAQPGQIPDKAEVGQLIKGFAQS 
VADQLEHFQLMHDASPATVGQHAKADKATLAVSQTALGEYAGRASKAIGEGLSNSIAS 
LDE H I S ALDLT LQDAE QGNKE S LHADRQALVDAKTT LVGLHAD FVKS PE AKRLAS VAA 
HTQLDNWSDLVTARNTVGGWKGAGPIVAAAVPQFLSS^4THLGYVRLSTSDKLRDTIP 
ETSSDANMLK7\SIIGMVAGIAHETWSWKPMFQAALQKTGLNERLNMVPMKAV 

VIPDPFELKSEHGE LVKKT PE E VAQDKAFVKS E RALLNQKKVQGS STHPVGE LMAY S A 
FGGSQAVRQMLNDVHQINGQTLSARALASGFGGAVSASSQTLLQLKSNYVDPQGRKIP 
VFTPDRAESDLKKDLLKGMDLREPSVRTTFYSKALSGIQSSALTSALPPVTAQAEGAS 
GTLSAGAILRNMALAATGSVSYLSTLYTNQSVTAEAKALKAAGMGGATPMLDRTETL" 
CDS complement (10828 . .11322) 

/note="ORF4" 
/codon_start=l 
/transl table=ll 
/product= "unknown" 
/protein id=" AAF71502 . 1 " 
/db_xref="GI : 8037795" 

/ translation="MTNNDQYHTLINEICALSLISTPERFYESANFKISEVDFTLQFQ 
DRDE GRAVL I Y GDMGAL PARGRE S ALLALMD I N FHMFAGAHS PAFS FNAQT GRVLLMG 
SVALERASAEGVLLLMKSFSDLAKEWREHGFMGQATTAGSSTDQPVAPAT^RESLSAP 
GRFQ" 

gene 11628. .12902 

/gene="hrpW" 
CDS 11628. .12902 

/gene="hrpW" 

/codon_start=l 

/transl table=ll 

/product="HrpW" 

/protein id=" AAF71503 . 1 " 

/db_xref ="GI : 8037796" 

/translation="MSIGITPRPQQTTTPLDFSALSGKSPQPNTFGEQNTQQAIDPSA 
LLFGSDTQKDWFGTPDSTVQNPQDASKPNDSQSNIAKLISALIMSLLQMLTNSNKKQ 
DTNQEQPDSQAPFQNNGGLGTPSADSGGGGTPDATGGGGGDTPSATGGGGGDTPTATG 
GGGS GGGGT PT AT GGGS GGT PT AT GGGE GGVT PQ I T PQ LANPNRT S GT GS VS DTAGS T 
EQAGKI^^\A/ ^ KDTIKVGAGEVFDGHGATFTADKSMGNGDQGENQKPMFEL7\EGATLKNV 
NLGENE VDGI HVKAKNAQE VT I DNVHAQNVGE DL I TVKGE GGAAVTNLNI KNS SAKGA 
DDKWQLNANTHLKIDNFKADDFGTMVRTNGGKQFDDMSIELNGIEANHGKFALVKSD 
SDDLKLATGNIAMTDVKHAYDKTQASTQHTEL" 
CDS complement (12986. .14446) 

/note="ORF5" 
/ codon_start=l 
/ transl table=ll 
/product= M unknown" 
/protein id=" AAF7 1504 . 1 " 
/db_xref="GI: 8037797" 

/translation="MHINRRVQQPPVTATDSFRTASDASLASSSVRSVSSDQQREINA 
IADYLTDHVFAAHKLPPADSADGQAAVDVHNAQITALIETRASRLHFEGETPATIADT 
FAK7\EKLDRLATTTSGALRATPFAMASLLQYMQPAINKGDWLPAPLKPLTPLISGALS 
GAMDQVGTKMMDRATGDLHYLSASPDRLHDAMAASVKRHSPSLARQVLDTGVAVQTYS 
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ARN AVRT V LAP ALAS R P AVQ GAVDL GV SMAGGLAAN AG F GN R L L S VQ S RDH QR GGAL V 
LGLKDKE PKAQLSEENDWLEAYKAI KSAS YSGAALNAGKRMAGLPLDMATDAMGAVRS 
LVS AS S LTQNGLALAGG FAGVGKLQEMAT KN I T D PAT KAAVS QLTNLAGS AAV FAGWT 
TAALTTDPAVKKAE S FI QDTVKSTASSTTGY VADQTVKLAKTVKDMGGEAI THTGAS L 
RNTVNNLRQRPAREADIEEGGTAASPSEIPFRPMRS " 

CDS 14818.. 15891 

/note="ORF6" 
/codon_start=l 
/transl table=ll 
/product= "unknown" 
/protein id=" AAF71505 . 1 " 
/db_xref="GI : 8037798" 

/translation="MSGPFEKKWRCFTRTVTYVGWSLFWLLLWDVAVTVDVMLIEGKG 
IDFPLMPLTLLCSALIVLISFRNSSAYNRWWEARTLWGAMVNTSRSFGRQVLTLIDGE 
RDDLNNPVKAILFQRHVAYLRALRAHLKGDVKTAKLDGLLSPDEIQRASQSNNFPNDI 
LNGSAAVISQAFAAGQFDSIRLTRLESTMVDLSNCQGGMERIANTPLPYPYVYFPRLF 
STLFCILMPLSMVTTLGWFTPAISTAA/'GCMLLAMDRIGTDLQAPFGNSQHRIRMEDLC 
NTIEKNLQSMFSSPERQPLLADLKS PVPWRVANASIGGLSRQKNRLGEGARLIASESL 
LWAPFRSVADVAPCHASAYLRRA" 
CDS complement {15800. . 16852) 

/note="ORF7" 
/codon_start=l 
/transl_table=ll 
/product=" unknown" 
/protein id=" AAF71506 . 1 " 
/db_xref="GI : 8037799" 

/translation="MYIQQSGAQSGVAAKTQHDKPSSLSGLAPGSSDAFARFHPEKAG 
AFVPLEGHEEVFFDARSSFSSVDAADLPSPEQVQPQLHSLRTLLPDLMVSIASLRDGA 
TQYIKTRIKAMADNSIGATANIEAKRKIAQEHGCQLVHPFHQSKFLFEKTIDDRAFAA 
D Y GRAGGDGHACLGLS VNWCQS RAKGQS DE AF FHKLE D YQGDALL PRVMGFQH I E QQA 
YSNKLQNAAPMLLDTLPKLGMTLGKGLGRAQHAHYAVALENLDRDLKAVLQPGKDQML 
LFLSDSHAMALHQDSQGCLHFFDPLFGWQADSFSNMSHFLADVFKRDVGTHWRGTEQ 
* RLQLSEMVPRADFHLR" 
CDS complement (16866. .17345) 

/note="ORF8" 
/codon_start=l 
/ transl table=ll 
/product= "unknown" 
/protein id= M AAF71507 . 1 " 
/db_xref="GI : 8037800" 

/translation="MRPVEAKDRLYQWLRNRGIDAQEGQRHNVRTANGSECLLWLPEQ 
DTSLFIFTQIERLTMPQDNVILILAMALNLEPARTGGAALGYNPDSRELLLRSVHSMA 
DLDETGLDHLMTRISTLAVSLQRYLEDYRRQEQAGKTAQKEPRFLPAVHLTPRTFMT" 
CDS 17562.. 17849 

/note="ORF9" 
/codon_start=l 
/ transl_table=ll 
/product=" unknown" 
/protein id=" AAF71508 . 1 " 
/db_xref="GI : 8037801" 

/ translation="MLKKCLLLVISMSLGGCWSLMIHLDGERCIYPGTRQGWAWGTHN 
GGQSWPILIDVPFSLALDTLLLPYDLTAFLPENLGGDDRKCQFSGGLNVLG" 

CDS complement (17 912 . . 18358) 

/note="ORF10" 
/ codon_start=l 
/ transl_table=ll 
/product= "unknown" 
/protein id=" AAF71509 . 1 " 
/db_xref="GI : 8037802" 

/translation="MKQVEVQIITELPCQVLILEQEAVAEGFRFLTRLIEEWRSGKNR 
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gene 



CDS 



CDS 



FEAKGECLMWLLDGJVLAGIGGLSRDPHARGDMGRLRRLYVASASRGQGLGKTLVNRL 
VEHAAQEFFAVRLFTDTPSGAKFYLRCGFQAVDEVHATHIKLLRRV" 

complement (18766. .19392) 
/ gene="gstA" 
complement (187 66. . 19392) 
/ gene="gstA" 
/codon_start=l 
/transl_table=ll 
/product^"GstA" 
/protein id=" AAF71510 . 1 " 
/db_xref="GI : 8037803" 

/translation="MIIVHHLNNSRSQRILWLLEELGLPYEIKRYQRDPKTNLAPPEL 

KAIHPLGKSPVIEDGPHWIESGAIVDYLIRRHGDGRLQPDPASATYDEYVQWLHFAE 

GSAILPLMLNLYVGRLGDAGAPLHPRIHSELANYLGYLNEALGLTPYLLGDELSGADI 

QMS F I GE FAKMQ GMLQ PH PN LAAWVQR CQE R PAY RQALE QGGE Y S FAK " 

complement (19482. .19898) 

/note="ORFll" 

/ codon_start=l 

/transl_table=ll 

/product="unknown" 

/protein id=" AAF71511. 1 " 

/db_xref="GI : 8037804" 

/translation="MNLTDHYRAYIACLNDRSWHQLGNFVDDNVHYNGELISLSGYKT 
MLENDVRIIPDLRFKIDLLWEAPQVAARLVFNCSPQGRFLGQDTQGRRITFSENVFY 
T FSDGKI I GVWS VI DKAAVERQLRGE S AALCADRRV " 



BASE COUNT 
ORIGIN 

1 
61 

181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 



4782 a 5813 c 5536 g 3921 t 



cgttgcactc 
atatattaat 
atattaatat 
ctttaatcta 
cccgcaagcc 
cacctttgag 
tactcgagag 
cagaagaaaa 
gaagcgaaca 
tagaacagtc 
cacagaggct 
ctggcattta 
ttccaggcac 
cgcaggttta 
tagaatgttt 
ataatcatat 
gggtaacgct 
aaagcaccaa 
gttattggcg 
gaccgctgcg 
ggggcaactg 
tctctaccgc 
gctgctgagc 
gggccgactg 
ccacaggcgc 
ccagcgggat 
gcacgcgatt 
gcaggaatgc 
gctgggtcag 
ccggggtcag 
gtctgggcac 
gaccacgccg 
cggccagcgc 



tcatggtggg 
ttagttcccc 
caccatacca 
tcgttactcc 
atagggcctc 
cagcaagcgc 
aaacatataa 
ccgaacacac 
acaataacac 
gataccaacc 
tggatactgg 
gtcatcatcg 
agccatctaa 
ggttctgtga 
ctaaatgtgt 
gtagcgctct 
caataaaaga 
aaaagtgcgc 
tggatcactg 
atactcgtcg 
tatccggttt 
cggcgctgat 
caccggcaac 
cacgggcttg 
gggcgcgggc 
gctggcagtc 
ctgggtttta 
cgagttcagc 
gtctacggca 
tttcacaccg 
gtaatcctgg 
tcggttggcc 
cagcagccag 



tggcaagcgg 
gggaaatgag 
agacgaccct 
aatgcgaaca 
tccacacctc 
cccaaaatcg 
gacttttcca 
aaaacaagaa 
gagaaaacaa 
agcttagttc 
caaagcggtc 
cattcggcaa 
ggaatcgcgg 
accaggcggt 
gtaatctttc 
acatcatatg 
agttgtattg 
ttttcagggg 
gcaaaaacca 
tcatcacgct 
gtaagcggat 
tcagctgcag 
ggctgagccg 
ggcagcggcg 
agacgctcag 
accggggact 
ggtatcagca 
cgcaacaact 
tggttaagct 
taggcattgg 
gtttccttgg 
tcaatcgccc 
tcattattga 



agtattaata 
aaaaagatca 
accgatagac 
agcgcttaca 
aaagcagctg 
cgcaatgaaa 
aaacaactaa 
aaccaaacgg 
aaaacagcct 
cgccccacga 
atagccccgg 
tccgaacaaa 
aaaggattca 
taatacgatc 
acttacattc 
gctaagcgcc 
aggcagatca 
ttttcaatag 
cgacgcgcgg 
tgcgaggcga 
caggttccac 
gagctggctg 
ttttgggcga 
gacgctctgc 
ccgcccgttt 
caccggtaat 
gacgtcccgg 
ggccctcatc 
cgactacgtc 
ggttgcgcac 
gtaaattcag 
gaccgacggt 
actgatcatg 



ccacgtctta 
caaagttgag 
tcaggctctt 
gcgtccatgc 
tgatccggga 
cgcaactaac 
aggggtcaca 
tttttagcgg 
gacactaact 
gcagtcggat 
tttttcggca 
agcccacctg 
gcgtagctta 
gatgatcgcg 
ggctaaaaaa 
atctttaggg 
atattgtccg 
aacaatcgag 
ccccgtaggc 
cgaacggtca 
aacaggtgcg 
taacgcctca 
aggcaggttc 
aacgcgcact 
cacaatggct 
gcgcgcgatg 
tccatcgaag 
cacacccgcc 
aaaatacggc 
aaccattgag 
attccagtag 
gccctccccc 
caagcgggtc 



caaggcagaa 
aattactatc 
gagatgattg 
gctggctcgc 
caagagcagg 
ttctcgtcac 
agtaaggaag 
cgagcttaaa 
atttgcactt 
ttccgaacaa 
ccactcagta 
cttagactat 
ataccggaac 
tgccatcacc 
gttcatcaaa 
tccaaaaaac 
acaacgagaa 
taaaaccggg 
agctcgcgcg 
tccctgatgc 
gattgggcga 
ggcgcagtgg 
tcggctaact 
ggacgctcag 
gaaggggtga 
ctggtcgtga 
gtctttttgc 
gtggccgcga 
gtgttggcga 
agcgccaaca 
tccacaggca 
gcgttatagg 
aggtaatcca 
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1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 

5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 



tcgccgcctt 
gattgaagct 
agttggccat 
tgttgcgctc 
ctcccgtgat 
tcatgccttg 
cgttataaac 
ccccccaata 
agtccacaga 
cgcgagccag 
tcctgagcgt 
atcggcgcac 
ctggcattgc 
cagcgataaa 
gaagcatcat 
gagccgaaca 
tgaaaagcgt 
cgcccaccgc 
acaaagcccg 
agtggattct 
tacacccgga 
gccttgagcg 
tcgcaaatgc 
cgcgcatttc 
gtgattcgga 
gtttcgagct 
tggatgccaa 
tcgattccat 
tgcacgggca 
gctcgctggc 
cggccgtgaa 
aaacccaaga 
acgacggcga 
ctcgctggaa 
gcaacggctc 
tgccgcacgt 
cgttgctcag 
gcgggctgac 
cggcggtcgg 
gtgcggaccg 
caaactttca 
acggggacga 
acgctttaga 
tactgaacaa 
acctcgatcg 
cgccagaatg 
acagcaatgc 
accccaacat 
tcgaaatggg 
tcagcaacaa 
atcacaaacc 
tggacgaaaa 

aggaagcctg 
cgctgcccgg 
cccagatcga 
ggttcgaaca 
gttcaaacaa 
cattcgggcg 
ccaacatcta 
ttcagcatcg 



gctggaggcc 
gcgccccgtg 
ggggttataa 
gtccaggcgc 
aaatccgcga 
gccatcgacc 
aggcagatcg 
gaccagcccc 
ctggcagccc 
caagcgtggg 
gtccacccta 
acaacgcatt 
atgccactca 
atggacagag 
tcagccaacc 
acccaggcaa 
cggtaaattg 
gaaaaacgtc 
cgaatccggt 
gcgtaaccac 
ggcagccccc 
cagcccgtcg 
cactcgccaa 
aatgctggcc 
cggaccgatt 
gaaagacgaa 
gggaaagcct 
tcttgccaca 
ccagttgcta 
cgtgatccgt 
aatggagcgt 
gctcccaggc 
gcgtatgcgt 
aataccggaa 
ggtttatgca 
ggaagtcgaa 
cggcaaaacg 
gccgaaaaaa 
tttgagtggc 
tagcgcattc 
gctggaaggc 
cggcggtgtt 
cgagcaaagc 
caatcgcggc 
tgcgggcctg 
ctggaaagac 
ttatgtactc 
ggcttttgac 
caaagagatc 
acgcttcgtc 
cgtcacactc 
acacaacctg 
gcaatcgaca 
agggcagccg 
agacgccgag 
gcgcccggta 
gacctggcga 
cagcggtgtg 
caaaaacacc 
ctaccagggt 



accacgtcac 
gatggaatga 
gagctttcga 
tcgacaataa 
ttgctcagca 
agcctgcagc 
gagattttgt 
gacaccagcc 
acacacaagg 
ctcgatacgg 
cgtggcacgc 
gctgaatcct 
tcctgtgaag 
agattcaccg 
gtcacccctg 
cgctcttcgc 
ttccagaaat 
aagacgcccc 
ttttccaaca 
cccaaccagg 
cgtaaaaacc 
tacctcgatt 
ttccggtcac 
acagatcctg 
ccgccgcgcg 
aaactggttc 
gacttctcca 
cccaagcaaa 
caggccaacg 
agcagcaacg 
gaagacggca 
aaggcacaca 
gtgcatgagg 
ggcctggagg 
aaaagtgacg 
gacctgcagt 
acccaggcga 
accaaaggcc 
gacaagctgt 
gagggcgatg 
gtgcccctcg 
cacgcgctga 
tcaaaactgc 
ctgaccatgc 
gttggcctga 
gcaggcataa 
aagggcggca 
cgcaacacag 
gaaggcctcg 
gccctcgatg 
gacattcccg 
cacgccctca 
aagctggggg 
gtaaaggcac 
ggcaagggtc 
gaagaaaacg 
attccaaaaa 
gagaaatcca 
gcagaaacgc 
cgcctgggtc 



ggcgagcgtc 
attgccacaa 
tcatcggcag 
aatgcagata 
accagtcgcg 
gctgggcaac 
ctgcagcccg 
gcggcggacg 
cgcccatagc 
ggaaggcgac 
tcgccgttac 
ttcagccgta 
gaattttcac 
tgcagtcacc 
acgcacgtgc 
actcgttgag 
ccaaagcgcc 
cgcctgcttc 
gcagcccgca 
cgagcagctc 
tgcgcgtaag 
cagacaaccc 
ctgacagtca 
atcagcccag 
agcccatgct 
gcaactcaga 
cgttcaatac 
cctacctggc 
ggcactttct 
aagcactcct 
acattcacat 
tcgctcacat 
accgtctcta 
ataccgcttt 
atgccgtggt 
cattttcagt 
tcctactgac 
ttgagctcga 
ttatcgctga 
acccgaaatt 
gaggccacaa 
tcaaaaaccg 
aaagcggctg 
ccccgccacc 
gtgaaggacg 
aagatatcga 
agctgcacgc 
cactggccca 
acgaccgagt 
accagaacaa 
ggctggaagg 
ccagtaccgg 
accagttgcg 
ttttcaccaa 
ttatgcagct 
gtttgaatga 
ccgggctgac 
aaaaagccag 
cccgctggat 
tgaaagaggt 



gtaggtcgcg 




cagtgccagc 
agggctggcc 
ctggcgagcg 
ccgctgccac 
cgaaccttcc 
gtcctgacgc 
gaggactgcg 
ggcgggcatg 
ggttcccttt 
agtttttccg 
gtttggtatc 
atcgatccac 
tgcaactgac 
cagtgtcggc 
gcagcagaaa 
aaatgtggct 
aaatacccat 
gggcgcgcag 
gtttgatctg 
gatgaccgat 
cctgcagggc 
cagctccggc 
gtggcgcagc 
gccacaaggc 
gcccggcctg 
ccaccaaagc 
gcacctggcg 
tatagaagga 
cgacaccgcc 
taccaatgtg 
tcagttcgac 
caacagcctg 
cgacttgtcg 
cgcgccggac 
tgacatgagc 
cggcggcaag 
cactcagggc 
gaagctgatg 
ccgcgtcacc 
tcagggcgag 
gaacctgacc 
caccgccgct 
cattcaacgc 
tcgcctgcaa 
actcaagatt 
gaccgcacgc 
gatcaaagcc 
gctgaccgcc 
cgatatcaag 
cgggctttac 
agcccgctgg 
cgacgacaac 
caaggcaggc 
tgtgcactcg 
gctcagaatg 
caccagcgag 
gaagaacgta 
ttatgaaacc 



ctttgatgca 
gcggccggag 
tccagcggca 
cggacactgg 
atacgctcat 
acgtcctcgc 
ttatcatctc 
ggcggcgaat 
atttgaacag 
ggcgggaatg 
tgaaaccgag 
atggaacccg 
aggcggctat 
cggaacaccg 
ctgcaggaaa 
aagcgggcgc 
gctgccacgc 
acgcccagaa 
agggcaccca 
acgcatgaaa 
ccgcaagacc 
gaagaagcgg 
tctgacggta 
agcaaaatcg 
aacggaggcc 
agcattcagc 
gctccattgc 
aaagacggcg 
caagacgaca 
aagaaaccac 
agcggccgca 
cttctcagtc 
ccgataagca 
tccactggcg 
agcccgttca 
aacagagcag 
ccggtgattg 
gcgcaggcgg 
agactttaca 
cccgagcagg 
ggattcatca 
actcactccc 
aatgcgctgg 
gaccggctca 
tgggacgcaa 
cgcggcgccg 
gcggccgaac 
tcgacaaaag 
tttgcaatgg 
cacagtaagg 
agcctgtcgc 
tgcctgccca 
acgccggttg 
gtgctcagcg 
caatggcaaa 
cgcatcacag 
gacgtcaata 
ttcatccgcg 
ggtgaccata 
gagtcgatgc 
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5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 
6721 
6781 
6841 
6901 
6961 
7021 
7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 
7621 
7681 
7741 
7801 
7861 
7921 
7981 
8041 
8101 
8161 
8221 
8281 
8341 
8401 
8461 
8521 
8581 
8641 
8701 
8761 
8821 
8881 
8941 
9001 
9061 
9121 



tgttcaagca 
acctgaaagc 
tcaaggaact 
cgatcggtca 
agcatggcga 
tgggcacaaa 
ccttgactca 
agcatcaagg 
atgccagcga 
aaagccttgg 
cgttacaaaa 
tggtcacaga 
agacattcct 
ccacaggcag 
tggagcatgg 
cgttcatcat 
accgtaacta 
gcgaaggtgc 
tttttgacgc 
ccaactttcg 
tggtcttcaa 
agttgaatcc 
gattcaactt 
ccgaagaccg 
gcggattcgc 
agaaaaacga 
acgtgacggc 
gcacacccgc 
gcggagcgct 
acgtcgccaa 
cgttcctgga 
gctacacagg 
tgtttgccga 
tgaaacgcgc 
cacgctttga 
aaataatgag 
tcatgcgcca 
cgctggcacg 
gcctcaacgg 
tgcgcatcaa 
caacaccgtt 
gcatcaactt 
aattgtcacg 
tcgaactgaa 
tttattcaag 
acgtcacaaa 
tctccctgat 
aacagacgca 
accatctcac 
agagcgcata 
atgaagcggg 
agcatgaagc 
gggtcaagca 
agcgcctcgc 
tccacggcca 
gcgactcgtg 
ttcggatgac 
ccatgcctgc 
tggacagata 
cactgagcgt 



actggagctg 
gcgcatcacc 
ggaaaccctg 
gagctatggc 
gctggccaag 
gctcaacttc 
agtggctccg 
gctgaaactc 
ggatcatggc 
cgcgctgctc 
aaagctggcg 
catgggcttt 
caagtcgttc 
caaggaccag 
cgacgacgaa 
tcttgccgac 
catactcaat 
gggaaacgtg 
aaataatcct 
cctgggcgtg 
tgttccggat 
attgcaggtg 
cgacctcacg 
agacccgaat 
tgcgaacatc 
caagaccgaa 
cggcgggcag 
ctccgcccca 
caatttcagt 
gccgataacg 
caacacgacc 
caagaaaccg 
cataccaccg 
ggcggtcgag 
aaccagcaaa 
ttccgtgcgc 
ggacccgaaa 
cgtacggctg 
caccatgact 
gcgtctggtg 
ggtcagctat 
cgtttatggc 
accatcggca 
gagctaataa 
cctgtaaaaa 
acgctggctc 
cgaccgcaaa 
gcaaatcggg 
tgtgctcggg 
cgccgttctg 
cgaaatcggg 
atgccggaca 
agccctcttc 
gtgctgacgc 
tctgggtagg 
ggataccgct 
ggatgttatt 
cgctttcaac 
ggacaccgaa 
gccacttgcg 



atccatgagt 
gcactggaag 
cgcgacgagc 
aaggcgaaaa 
ccgtcggtgc 
aaaagctctg 
tctgctgaaa 
agccaccaga 
ctgagcaaag 
gaccaggtcg 
acgctgcgtg 
accgataaca 
aaaaaagcgg 
gccgagctgg 
gtcgggctgc 
aaggctacag 
gccgagcgtt 
agcggcggtt 
gcacgcagtg 
gacgtgaccg 
gaagacatcg 
ctgaaaaaag 
gcaggtggaa 
gccgacccca 
accgttaacc 
ctgaaggaag 
cttcgcgctc 
ggccccactc 
gtggaaaaca 
actgaaggtc 
aaagcaaaac 
gatgaggtta 
cccaaagaca 
catcgggcat 
accaacctct 
gacgcgagcg 
cttcgcgcca 
gaaccgaagg 
caaagcgacc 
gtattccaca 
aacagtggag 
gcagaccagg 
tcgctcaagg 
cgaaaacagt 
agcacgcgct 
atcgagtgag 
ccagcagcca 
ggctcgttcc 
caattcaatg 
caaggtcagt 
ttgcgaagtt 
gcaggcgcct 
aagtgccctc 
gactgcgttc 
caactgcaat 
caacgtgctc 
caaagcgtct 
gccttggctt 
cccgtcgctg 
ccttcagcct 



ccgggggaag 
caaaactggg 
tggaaaatca 
accttaaaca 
gcatgcagtt 
gacatgactt 
accccaccaa 
aagccgacat 
cgcgcctggc 
aacagctacc 
atgtgactta 
aagcgctgga 
accatgccgt 
ccggaaaatt 
agcgcagcta 
ggctctggcc 
gcgagggcgg 
tcggtgccgg 
ttgatgtcgg 
ccaccgtcgc 
acgcattcgt 
cagtggacca 
ctgccgatat 
acagcgattc 
tgatgaccta 
gcggtaaaaa 
agatcggcgg 
ccgcatcaca 
ggacggtcaa 
tgagcaaatt 
tggcggagct 
ttcaggcgca 
acgacaagca 
cagccaacaa 
ccggcctgtc 
ccccgggcaa 
tgctcaagga 
actcactggt 
tctccagcat 
ccgcgaccca 
cgaatgtgag 
acaagccgat 
aagcggctgg 
aaaaaaagcg 
tcacgtgcct 
gccagttcac 
tgcaagcgcg 
gggcagcggc 
atcgccgctt 
gacgtgccga 
ttcatcgtca 
gcagcctgtg 
aatgcgtcat 
aacacacctt 
gcgcctcgtc 
ttctggaacg 
cggtacggtc 
ctgcggtaac 
ccagggccat 
gagcggtcac 



gcctccggca 
gcctcaaggc 
cagctacacc 
gcaggacggc 
tggcaagaag 
ggtcaaggag 
aaagttgctc 
acctttggga 
gctggatctg 
gccgcaaagc 
cggcgaaaac 
aagcggttac 
cagcgtcaat 
caaaagcatg 
cggagtgaac 
aacggcaggt 
cgttacgctg 
caaagactac 
caacaaccgc 
cgccagccag 
cgacgacctg 
tgagagctac 
acgcgccgga 
gttttctgcg 
caccgattat 
ccgcccgcgc 
cagccacacg 
aacagccgcc 
acggatcaag 
gtcgaagggc 
ggccgaccct 
actcgacggg 
gtacaaggca 
gcacagcgtg 
cagtgaaagc 
tgcgacaaga 
gatggagggc 
cgacaagatc 
gctggaggat 
ggctgaaaac 
cgtcactaaa 
tggttacacc 
cgacttgaag 
ccgcattgaa 
gggaaatgaa 
gctgcgcgca 
ctacgtcgaa 
caatgcggca 
cgttgttctg 
gctgggcgcc 
tagtccttta 
tccggcgccg 
cgtcttttgt 
catccacgac 
ccatgtgata 
tatgtggcag 
cagcataggt 
cgactggttg 
gttgcgcaaa 
aggcggcagt 



cggggtcaag 
gctacgctgg 
gcgctgatgt 
attctcaacc 
cttgctgatc 
ctgcaggatg 
ggcacgctga 
cagcgccgcg 
gtcacactga 
gacatagagc 
ccggtcaagg 
gaatcggtca 
atgcgcgcag 
ctcaagcaac 
ctcaccaccc 
gccaccggta 
tacctcatta 
tggccgggct 
acactgaccc 
cgcgccgggg 
tttgaaggtc 
gaggctcggc 
ataaacctga 
gtagtgcgcg 
tcgttgaccc 
tttttgaata 
gcccccacag 
aacaacttgg 
tttcgttaca 
cttggggaag 
ctgaatgcac 
cttgaagaac 
ttgcgcgact 
atggacaacg 
atacttacca 
gttgccgaat 
agtatcggga 
gatgaaggca 
cgcaacgaga 
ttcacctcac 
acactggggc 
ttcgacggcg 
aaagaggggt 
gtggcgcttt 
cccgcgcgtc 
tagacggaca 
gttcagactc 
atgaaagatg 
accgtcataa 
cagagaattg 
aggttaaaac 
ggattaacgc 
cggctgctta 
ccgaaccgta 
ggcgttttcc 
agactccctg 
gttgcaccgc 
gtgtacaacg 
atagcccccg 
gccgaggtca 
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t 



9181 
9241 
9301 
9361 
9421 
9481 
9541 
9601 
9661 
9721 
9781 
9841 
9901 
9961 
10021 
10081 
10141 
10201 
10261 
10321 
10381 
10441 
10501 
10561 
10621 
10681 
10741 
10801 
10861 
10921 
10981 
11041 
11101 
11161 
11221 
11281 
11341 
11401 
11461 
11521 
11581 
11641 
11701 
11761 
11821 
11881 
11941 
12001 
12061 
12121 
12181 
12241 
12301 
12361 
12421 
12481 
12541 
12601 
12661 
12721 



gtgcagaact 
cgcgcaggtc 
taaataccgg 
tttgcgaact 
tctgcccatt 
aggcactgta 
ggttcagcag 
tttttttgac 
tggtatccac 
aggcggcctg 
ccatcccgat 
ctcgcagctt 
actgcggaac 
gggcagtgac 
ggcgcttggc 
cgtcgaccag 
gcagagtgag 
cttcgccgat 
caagcgtcgc 
tcagttgaaa 
cgacctcggc 
agtccgagcc 
cttcaaaagg 
tgatcatcgg 
gctgtgtgtg 
cgtgtgaccc 
ggtgagctgt 
caccggcccc 
gcgtttggct 
catgaatcca 
cacgccttcg 
ctgcgcatta 
catcaacgcc 
gatcagaacg 
gattttgaaa 
gatttcgttg 
ggtgcggacg 
tttcagttcc 
agatgccggc 
ccccaaacat 
ctggagctgg 
tcacaccccg 
gtcctcaacc 
tgttgttcgg 
agaatccgca 
gtgcattgat 
atcaggaaca 
cggccgatag 
caagcgcaac 
gtggcggcgg 
gcggtggcga 
cctcaggtac 
atgtggtgaa 
ccttcactgc 
tcgagctggc 
gcatccacgt 
acgtcggtga 
acatcaagaa 
ctcacttgaa 
gtggcaagca 



ctgaataccc 
catacctttg 
aattttgcgc 
ggcagacacc 
gatctggtga 
agccatcagc 
cgcgcgttca 
cagctcaccg 
agccttcatt 
aaacatcggc 
tatcgaagcc 
gtcgctggtg 
cgcagccgcg 
gagatcgctg 
ctccggcgac 
cgcctgcctg 
atccagcgca 
tgccttgctt 
cttgtccgct 
gtgctccagt 
tttatccggt 
ctgaaactgc 
caaaggctcg 
cgcgccgctg 
tcgagccgcg 
tgactgatcg 
ctgggcggga 
gccgatccgc 
gctggggcaa 
tgctcgcgcc 
gcagaggctc 
aaggaaaatg 
agcaacgcgc 
gcacggcctt 
ttggcagatt 
ataagggtgt 
aataccagtc 
ttgcgttggt 
aaaacgggaa 
ccacatccct 
cgtcggtcca 
gccgcaacag 
aaacacgttc 
cagcgacaca 
ggacgccagc 
catgtcgttg 
gcctgatagc 
cgggggcggc 
aggcggtggc 
cacacccact 
gggtggcgta 
tggctcggtg 
agacaccatc 
cgacaaatct 
tgaaggcgct 
gaaagccaaa 
agacctgatt 
cagcagtgcc 
aatcgacaac 
gtttgatgac 



gaaagagcct 
agcaggtcct 
ccttgcgggt 
gccccgccaa 
acatcgttga 
tcacctaccg 
cttttcacga 
tgttcgcttt 
ggcaccatgt 
ttgaccacgc 
ttgagcatgt 
gacaaacgca 
acaatcggcc 
acgacgttgt 
ttgacgaaat 
tcagcgtgca 
ctgatgtgct 
gcacgaccgg 
tttgcatgct 
tgatcagcga 
atctgacccg 
ttcagttgat 
gcctgcagca 
accggagccg 
acattcagcc 
ggagtcagcg 
acagtatcgt 
gaactgatca 
caggttggtc 
actctttggc 
gttcaagggc 
ccgggctgtg 
tctcacggcc 
cgtcgcggtc 
catagaaacg 
ggtactggtc 
ttcctgctgg 
tgggcatata 
ccggtcgctg 
atcgaacgga 
attgcccact 
accaccacgc 
ggcgagcaga 
cagaaagacg 
aagcccaacg 
ctgcagatgc 
caggctcctt 
ggtacaccgg 
ggcggtgata 
gcaacaggtg 
acaccgcaaa 
tcggacaccg 
aaggtcggcg 
atgggtaacg 
acgttgaaga 
aacgctcagg 
acggtcaaag 
aaaggtgcag 
ttcaaggccg 
atgagcatcg 



tgctgtagaa 
ttttcagatc 
cgacataatt 
aaccggatgc 
gcatctggcg 
gatgggtgga 
acgccttgtc 
tcagctcgaa 
tcaggcgttc 
tgttgaccgt 
tggcgtcgct 
cataacccaa 
ctgcaccttt 
ccagttgcgt 
cggcgtgcaa 
gagactcctt 
catccagcga 
cgtattcgcc 
ggcctaccgt 
ccgactgagc 
gctgggcgaa 
agcgctcagg 
gactaccgat 
tcccatgctc 
gcgccgcgcc 
gcggattcat 
gctgctggtt 
ttggaatctc 
cgtcgaggag 
caggtcggaa 
cacagagccc 
ggcgcccgcg 
gcgcgcgggc 
ctgaaactgc 
ttcaggtgtg 
attgttggtc 
cgtgtgcaca 
aaaaaaggaa 
cgctttgcca 
cagcgatacg 
tagcgaggta 
cactcgattt 
acactcagca 
tcaacttcgg 
acagccagtc 
tcaccaactc 
tccagaacaa 
atgcgacagg 
ctccgaccgc 
gcggcagcgg 
tcactccgca 
caggttctac 
ctggcgaagt 
gagaccaggg 
atgtgaacct 
aagtcaccat 
gcgagggagg 
acgacaaggt 
acgatttcgg 
agctgaacgg 



cgtggtgcgt 
gctctcggcg 
cgacttcaat 
cagagctctt 
cacagcctga 
cgaaccctga 
ctgagcgact 
ggggtcagga 
gttgaggcca 
ctcgtgagca 
gctggtctcg 
gtgtgtcatt 
ccagccaccc 
atgtgcggcg 
acctaccagg 
gttgccctgt 
cgcgatgctg 
aagggcagtc 
tgcgggcgaa 
aaaacccttg 
tttttccagc 
agacaatttc 
caacaacgca 
agccttgaag 
ggcagacgag 
gcctgcagtg 
tacccggctg 
ccaggagccg 
cctgcagttg 
aacgacttca 
atcagcagca 
aacatgtgaa 
aacgcgccca 
agggtgaagt 
gaaatcaggc 
atttcaaggc 
ctgagtcgca 
cttttaaaaa 
ctcacttcga 
gccacttgct 
acgcagcatg 
ttcggcgcta 
agcgatcgac 
cacgcccgac 
caacatcgct 
caataaaaag 
cggcgggctc 
tggcggcggc 
aacaggcggt 
tggcacaccc 
gttggccaac 
cgagcaagcc 
ctttgacggc 
cgaaaatcag 
gggtgagaac 
tgacaacgtg 
cgcagcggtc 
tgtccagctc 
cacgatggtt 
catcgaagct 



accgacggct 
cggtccgggg 
tgcagcagcg 
gcactcagcg 
gaaccaccga 
accttcttct 
tcctcgggcg 
ataaccgtat 
gtcttctgca 
atgcccgcca 
ggaatcgtgt 
gaagacaaga 
accgtgttac 
accgaagcaa 
gtggttttgg 
tcggcatctt 
ttgctcaggc 
tgactgacgg 
gcgtcatgca 
atcagttgcc 
cgctgctgca 
tcggccatga 
gcacgcgaac 
gcctgcaaaa 
ctttctgtcg 
actgcatttg 
agtttgacgc 
aaaggctctc 
tggcctgccc 
tcaacaacag 
cacgaccggt 
agttgatgtc 
tgtcaccgta 
ccacttcgct 
tgagtgcgca 
ctctgagtgc 
ggcataggca 
cagtgcaatg 
gcaagctcaa 
ctggtaaacc 
agcatcggca 
agcggcaaga 
ccgagtgcac 
agcaccgtcc 
aaattgatca 
caggacacca 
ggtacaccgt 
ggtgatacgc 
ggcggcagcg 
actgcaacag 
cctaaccgta 
ggcaagatca 
cacggcgcaa 
aagcccatgt 
gaggtcgatg 
catgcccaga 
actaatctga 
aacgccaaca 
cgcaccaacg 
aaccacggca 
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12781 
12841 
12901 
12961 
13021 
13081 
13141 
13201 
13261 
13321 
13381 
13441 
13501 
13561 
13621 
13681 
13741 
13801 
13861 
13921 
13981 
14041 
14101 
14161 
14221 
14281 
14341 
14401 
14461 
14521 
14581 
14641 
14701 
14761 
14821 
14881 
14941 
15001 
15061 
15121 
15181 
15241 
15301 
15361 
15421 
15481 
15541 
15601 
15661 
15721 
15781 
15841 
15901 
15961 
16021 
16081 
16141 
16201 
16261 
16321 



agttcgccct 
tgaccgacgt 
gaatccagac 
tgtttagcta 
gagaagccgc 
tattgaccgt 
tgacggtctt 
tcgatttcac 
gcgcggccgt 
tgaccgcggc 
ctacccctgc 
ccaggcttct 
gcttgccagc 
cgagccagtc 
ctaatgcacc 
ctgcgttggc 
cgggtctgga 
tctgaaccgc 
ccgaagcggc 
ccgtcgcgcg 
aaatgagcgg 
gctgcatgta 
tagtcgtcgc 
gggtttcccc 
cattgtgtac 
cagcgaacac 
agctgacaga 
ccgtcgcagt 
gttctacgga 
gtgaaagcaa 
gtccgccggt 
ctgaataatc 
ttgcagatta 
atggcttggc 
tctggtcctt 
tcgctgttct 
ggcaaaggca 
ctgatcagct 
ggcgcaatgg 
gaacgggatg 
ttgcgtgccc 
ctgtcgcccg 
aatggctctg 
ctgacccgcc 
atcgccaaca 
ttctgcatcc 
tccacggtgg 
ccgttcggca 
aacctgcaat 
cccgtaccgt 
ttaggggaag 
gttgcagacg 
caagaaaatg 
aaaaatgcag 
acagaagcat 
ccagagcaac 
ccaacttggg 
cctgctgctc 
cctccagttt 
ttactgatag 



ggtgaaaagc 
caaacacgcc 
aagtagcttg 
cagctcacag 
cgtgcccccc 
attgcgcaag 
cgccagtttg 
cgtgtcctgt 
ggtccagcct 
cttggtcgcc 
aaagccaccc 
taccgcaccc 
gttgagcgcc 
gttttcttcg 
gccacgctgg 
agccagacca 
cgccagtgcc 
aacccccgtg 
catcgcatcg 
gtccatcatc 
ggtcagcggt 
ctgaagcaac 
caatcggtcg 
ttcgaagtgc 
gtcaactgca 
atgatctgtc 
tcgcacagag 
cacaggcggt 
agtttgaaca 
tacagtgaac 
ttaaaaggat 
tgcgtacgcc 
cgcaaattga 
gcttagcgag 
tcgagaaaaa 
ggcttctgct 
tcgacttccc 
ttcgcaactc 
tcaacacttc 
acctcaacaa 
tgcgcgcgca 
acgagattca 
ctgcggttat 
tggaatcgac 
cgccactgcc 
tgatgccgct 
taggctgcat 
acagtcagca 
cgatgttctc 
ggcgcgtggc 
gcgcgaggct 
ttgctccgtg 
gctcatgttg 
acatccctga 
ctggtcttta 
cgcatagtgc 
aagtgtgtcc 
gatatgctgg 
gtgaaagaag 
ccccagacaa 



gacagtgacg 
tacgataaaa 
aaaaaagggg 
attgcttacg 
tcttctatat 
ctggcgccgg 
acggtctggt 
atgaacgact 
gcgaaaacgg 
gggtcggtga 
gccagggcca 
attgcgtcgg 
gcacccgagt 
ctcagttgag 
tgatcacgcg 
cccgccatcg 
ggagccaata 
tccagaacct 
tggagcctgt 
ttggtgccca 
ttgagcggag 
gaggccatgg 
agcttttccg 
aggcggctgg 
gcttggccat 
aggtaatcgg 
ctggaggcaa 
tgttggacgc 
gcgcagtgct 
tgtcgatcaa 
cgacgaaggc 
cactaccaag 
aattaagcga 
taagcgcctt 
atggcggtgt 
ctgggacgtg 
cctgatgccc 
gagtgcctat 
acgcagtttt 
ccctgtcaaa 
cctcaaaggc 
gcgcgccagc 
ctcgcaagcc 
catggtcgat 
ctacccctac 
gagcatggtc 
gctgctggca 
ccggatccgc 
ttcgccagag 
caacgcatca 
tatcgcaagt 
ccacgccagt 
ctgaagctgt 
ctgtcctgat 
ccgggctgca 
gcgtgctgtg 
agaagcatag 
aagcccatta 
gcctcatccg 
gcgtgcccgt 



atctgaagct 
cccaggcatc 
gtggactcgt 
accgcatagg 
cagcttcacg 
tatgggtgat 
cggctacgta 
cggctttttt 
ctgccgaacc 
tatttttcgt 
gaccgttttg 
tcgccatatc 
agctggccga 
ccttgggctc 
actgcacact 
atacaccaag 
cggtacgtac 
gtcgagcaag 
ccggcgaggc 
cctggtccat 
ccggcagcca 
caaagggcgt 
ccttggcgaa 
cgcgcgtctc 
cagccgaatc 
caatcgcatt 
gagacgcgtc 
gtcggttgat 
gaagcgggcg 
acagcgccag 
tgtgtggtcc 
gactgcgccg 
gctttaagga 
cttccaaacc 
ttcacccgaa 
gccgtcaccg 
ctcacgttgc 
aaccgttggt 
ggccggcagg 
gccatactct 
gacgtcaaaa 
cagagcaaca 
tttgccgccg 
ctgtccaact 
gtttatttcc 
accaccctgg 
atggaccgca 
atggaagacc 
aggcagccgc 
attggcggtc 
gaaagtctgc 
gcgtacctac 
ctgcctgaac 
gcagagccat 
acactgcttt 
ctctgcccag 
gtgctgcgtt 
ccctgggtag 
actgcccttt 
cgccacccgc 



ggcaacgggc 
gacccaacac 
cgagtccacc 
ccgaaacggt 
agccgggcgt 
cgcctccccg 
gcctgtggta 
caccgcggga 
tgccaggttg 
cgccatctcc 
ggtcaggctg 
cagtggcaga 
tttgattgct 
tttatccttc 
gagcaggcgg 
gtccacagca 
ggcgttgcgc 
gcttggcgag 
gctcaggtaa 
ggcgcccgac 
atcgcccttg 
cgcccgcaac 
ggtgtcggcg 
gatcagcgca 
ggccggcggc 
tatctcgcgt 
ggacgctgtc 
gtgcatggaa 
tgtccggagc 
aaacagcgaa 
cggatcggtt 
aaaaatcacc 
tggcagcgta 
agcaaaggag 
ccgtgaccta 
tggacgtcat 
tttgctcggc 
gggaagcgcg 
tactgacgct 
ttcaacgtca 
cagcaaaact 
acttccccaa 
gccagttcga 
gtcagggcgg 
cacggctgtt 
gctggttcac 
tcggtacaga 
tgtgcaacac 
tgctggctga 
tgagcaggca 
tctgggcacc 
gtcgcgcttg 
cacgccaaaa 
cgcatggcta 
gagatcgcga 
cccttttcca 
ctgcaacttg 
caatgcatcg 
tgcacggctc 
gcggccatag 



aacatcgcca 
accgagcttt 
ccctttttac 
atttcacttg 
tgacgcaggt 
cccatgtctt 
ctggatgcag 
tcggttgtca 
gtcaactgac 
tgcaacttgc 
gacgctgaca 
ccggccatcc 
ttataagcct 
aaaccgagca 
ttgccaaagc 
ccctgcacgg 
gccgagtacg 
tggcgcttca 
tgcagatcac 
agcgctccgg 
ttgatcgcag 
gcgcctgatg 
atggttgccg 
gtgatctgcg 
agtttatgcg 
tgctgatcgg 
cgaaagctat 
attccctctc 
gactacttgc 
acgtccggtc 
gacggttcca 
gtcgtttgtg 
agttcacaac 
tgccgcaatg 
cgttggctgg 
gctgatagaa 
actgatcgtg 
caccttgtgg 
gatcgatggc 
tgtggcttac 
cgacgggtta 
tgacatcctc 
cagcatccgt 
catggagcgc 
cagcacgctg 
cccggcgatc 
cctgcaagcc 
catcgaaaag 
cctgaaaagc 
gaaaaacagg 
atttcgctca 
aacacatcag 
agaggatcaa 
tcactcaaaa 
tcaaggtttt 
agtgtcatgc 
tttgaatagg 
ccctgatagt 
tgacaccaat 
tcagcagcaa 
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16381 
16441 
16501 
16561 
16621 
16681 
16741 
16801 
16861 
16921 
16981 
17041 
17101 
17161 
17221 
17281 
17341 
17401 
17461 
17521 
17581 
17641 
17701 
17761 
17821 
17881 
17941 
18001 
18061 
18121 
18181 
18241 
18301 
18361 
18421 
18481 
18541 
18601 
18661 
18721 
18781 
18841 
18901 
18961 
19021 
19081 
19141 
19201 
19261 
19321 
19381 
19441 
19501 
19561 
19621 
19681 
19741 
19801 
19861 
19921 



acgctctatc 
gctgacagcc 
tgctgttgtc 
atgaggcgat 
gtacctgctc 
cgaaaaagac 
aacgggcgaa 
gttgcgtctt 
ctctgtcagg 
ttctgggcgg 
gagacggcca 
gccattgagt 
gcgccacctg 
tcctgcggca 
ggcaaccaga 
tgcgcatcga 
cgcatggaaa 
ggttcccttt 
gccgccgatg 
cagggttgaa 
tactggttat 
agcgttgcat 
agagttggcc 
cctacgacct 
tcagtggagg 
ccccgacgct 
tggcatgcac 
gagtatcagt 
tcaccagagt 
gcctgcccat 
cgccgtccag 
acctccactc 
ccaggatcag 
aatgacctca 
catatgcaac 
gctgatatca 
acagcaactg 
atccagctac 
gatcagttcc 
tcattccgat 
gtactcaccg 
ccaggctgca 
gaagctcatc 
cagagcttca 
aggcgcgccc 
cgaaccttcg 
cggttgcagg 
gataaccaca 
cagctcaggc 
caggccaagt 
gacaataatc 
gccgccgtag 
cagagcgctg 
caaacgccga 
cgccgtccct 
cgggcagcga 
ataatcctga 
ttgtaatgca 
caggcgatgt 
caaagcagtc 



atcgatagtt 
gtgctcttgg 
cgccatagcc 
agagaccatc 
gggactggga 
ctcttcatgc 
cgcatccgac 
agcggcaacc 
tcatgaacgt 
tttttccggc 
atgtgctaat 
gcacactgcg 
tgcgagcagg 
tcgtcagcct 
gcagacactc 
tgcctcgatt 
ttccccgctc 
agggtttgca 
gcaaaggtaa 
tttgttgggt 
atcaatgtca 
ctatcccggc 
catacttata 
caccgctttt 
attgaacgtg 
tttggagagc 
ctcgtcaact 
gaacaggcgc 
ctttccaagg 
atcaccccgg 
aagtacgacc 
ctcgatcaag 
aacctgacaa 
tccacagtgg 
agcaaaggct 
ttcaagcacc 
ttaaaggctc 
gtgagcaagg 
gtcgaggact 
ggagcgtatt 
ccctgctcca 
aggttaggat 
tgaatatccg 
ttcagatagc 
gcgtcaccca 
gcgaagtgca 
cggccgtcgc 
tggggaccgt 
ggcgcgaggt 
tcttcgagta 
atgtgggtct 
agtgccgtga 
ccgactcacc 
ttatcttgcc 
gtgtgtcctg 
cctgtggtgc 
cgtcgttttc 
cattgtcatc 
aagcccgata 
actgtcaatc 



ttttcaaata 




ttgattctgg 
agatccggta 
agatcagcgg 
ccctccaatg 
gaaccggggg 
cctgattggg 
tcgtggggtc 
ttgctcctgg 
tcgcgtcatg 
caacaacagt 
ctccagattc 
ttcgatctgt 
gcttccattc 
gcgcagccac 
gtttaacgat 
ctaatatcaa 
cgggatgggc 
gacgttaaaa 
cttggcggct 
actcgccaag 
gacgtgccgt 
ctgcccgaaa 
ctcggttgat 
acaccaggga 
gcctgaaagc 
acggcgaaaa 
ccttgacctc 
gcatgcggat 
atgaggcatt 
cgggtaagaa 
ggcaattcag 
tcctgcgctg 
gcaaccagtg 
tgcaagccga 
atgccaagaa 
ctgatcggtt 
gaacagcgac 
gcaaggagcc 
gcgcctggcg 
gcggctgcag 
cgccactcaa 
ccagatagtt 
ggcgaccgac 
gccattgtac 
catgacggcg 
cttcgatcac 
tggttttcgg 
accacagaat 
ccgctgggtg 
atcgaatgtc 
gcgaagctga 
atcgctgaat 
ccccagaaat 
ttcaacgacc 
cagcattgtt 
gacgaagttg 
gtgatcggtc 
atccagataa 



gaaatttgct 
ttttggcttc 
tcttgatgta 
gcagggtacg 
catcgaccga 
ggacaaaggc 
cgagtccgga 
cgccagattg 
agatggacag 
cgtcgataat 
aggtgatcaa 
tcccttgaat 
agcgccattg 
gtgaagatga 
gcggtcctta 
tgataaagcc 
gattttcctc 
tgcgattctt 
agcgagtttt 
cgaaggaatg 
gctggagcct 
gttgggcgtg 
tttccctcgc 
atcttggcgg 
ccatattttt 
ttcaaacccg 
cgcaacgtaa 
attcctgcgc 
ttgatgcgct 
cacgcgaaag 
cacccttggc 
acctgaagcc 
taatgatctg 
gcgaaaacac 
cacaccacca 
gtagaagcac 
agccagcgct 
gaagctggcg 
gtttacgcgc 
tgttcaacag 
atacgcaggt 
cattccctgc 
ttcgtcgccc 
ggccagttca 
gtacaggttg 
gtactcatcg 
gatcaggtaa 
cggggatttg 
gtcgcgctgg 
gcgctgcgaa 
agagtgggat 
ctctggcgac 
cgctccactg 
gtgtagaaca 
cgaccctgtg 
agcaaatcga 
ttgtagccgg 
cccaactggt 
aggttcatgg 
cccgcacagt 



ctggtgaaac 
gatgttcgca 
ttgcgtggcg 
caacgaatga 
cgaaaaggaa 
gcccgccttt 
caatgacgag 
ctggatatac 
ccggtaagaa 
cttccagata 
gtccggtctc 
cagggttata 
ccagaatcaa 
acaacgaagt 
cgttgtggcg 
gatcttttgc 
tgtggttcaa 
gtaaaaatcg 
tggtaacgtt 
tatgcttaaa 
gatgattcat 
gggaacccat 
gttggacaca 
tgatgaccgc 
actgcgacag 
ccttaaaagc 
gtaaaatttt 
cgcatgctcc 
tgcgacgtat 
gcctccgata 
ctcgaatcga 
ctctgctact 
gacttctacc 
gagcaggtct 
gaaccgggtt 
atgaaccgtc 
aaactggcac 
gcagagtccg 
caccggtatg 
ctcacttact 
ctttcctggc 
attttggcga 
agcagataag 
gagtgaatgc 
agcatcagcg 
taggtggcgc 
tcgacgatgg 
cccagcggat 
tagcgtttta 
cgtgagttgt 
gtctagaaaa 
ctcagacgcg 
ccgctttatc 
cattttcgga 
gcgagcagtt 
tcttgaaacg 
aaaggctgat 
gccaactacg 
cgcgccctcc 
tttaacagag 



gggtggacaa 
gtcgcgccta 
ccgtcacgta 
agctggggtt 
gagcgcgcat 
tcgggatgaa 
ggcttatcgt 
ataaaccgcc 
ccgaggctct 
gcgctgcaac 
atccagatcc 
gccaagcgca 
aatgacgttg 
gtcctgttct 
ttgaccctcc 
ctcgacaggc 
gacgtgatgc 
actcgtgagt 
gccgttgttg 
aaatgcctgc 
ctggacggcg 
aacggagggc 
ctgctgctgc 
aaatgtcagt 
aagagtgcgg 
tttatatgcg 
gctccgctcg 
acaagtcgat 
aaccgtcgta 
cctgccagag 
ttctttccgg 
gcctcttgct 
tgtttcatct 
ggacagaatg 
cgacagttaa 
gcaagaaaat 
cggcaaacaa 
gtaacgaccc 
gtcaggctgt 
tcgcaaacga 
atcgttgtac 
attcgccaat 
gcgtcagccc 
gcggatgcaa 
gcagaatggc 
tggcaggatc 
cgccagactc 
gaatggcctt 
tctcgtacgg 
tcaggtggtg 
agactgctgg 
tctgtcggcg 
gattaccgac 
aaaggtgatg 
gaagaccagc 
caagtcgggg 
cagctcaccg 
gtcattcaga 
ttcaggtgct 
tcatagggaa 
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19981 ctcgtgcggc cgacatcgcc ctaagcctca catctatgta ctggcgcgac gctggtttca 
20041 agcgaaggac tt 

// 
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AF232004 52498 bp DNA linear 

Pseudomonas syringae pv. tomato strain DC3000 Hrp 
island, complete sequence. 

AF232004 AF061028 AF061029 AF232006 L41861 
AF232004.3 GI: 13325077 



BCT 05-MAR-2001 
pathogenicity 



Pseudomonadales ; 



syringae, glycinea, 
Yersinia ysc 

not 



Pseudomonas syringae pv. tomato 
Pseudomonas syringae pv. tomato 
Bacteria; Proteobacteria; Gammaproteobacteria; 
Pseudomonadaceae; Pseudomonas. 

1 (bases 25494 to 29778) 

Preston, G., Huang, H.C., He,S.Y. and Collmer,A. 
The HrpZ proteins of Pseudomonas syringae pvs. 
and tomato are encoded by an operon containing 
homologs and elicit the hypersensitive response in tomato but 
soybean 

Mol. Plant Microbe Interact. 8 (5), 717-732 (1995) 
96025089 
7579616 

2 (bases 22134 to 25847; 29687 to 32670) 

Deng,W.L., Preston, G. , Collmer,A., Chang, C.J. and Huang, H.C. 
Characterization of the hrpC and hrpRS operons of Pseudomonas 
syringae pathovars syringae, tomato, and glycinea and analysis of 
the ability of hrpF, hrpG, hrcC, hrpT, and hrpV mutants to elicit 
the hypersensitive response and disease in plants 
J. Bacteriol. 180 (17), 4523-4531 (1998) 
98389667 
9721291 

3 (bases 31672 to 51723) 

Charkowski, A. O . , Alfano,J.R., Preston, G. , Yuan, J 
Collmer, A. 

The Pseudomonas syringae pv. tomato HrpW protein has 
similar to harpins and pectate lyases and can elicit 
hypersensitive response and bind to pectate 
J. Bacteriol. 180 (19), 5211-5217 (1998) 
98422476 
9748456 

4 (bases 901 to 22404; 31672 to 51723) 
Alfano,J.R., Charkowski, A. O. , Deng,W.L., Badel,J.L., 
Petnicki-Ocwie j a, T . , van Dijk,K. and Collmer, A. 

The Pseudomonas syringae Hrp pathogenicity island has a tripartite 
mosaic structure composed of a cluster of type III secretion genes 
bounded by exchangeable effector and conserved effector loci that 

to parasitic fitness and pathoge 
Acad. Sci. U.S.A. 97 (9), 4856- 



He,S.Y. and 



domains 
the plant 



contribute 
Proc. Natl 
20243785 



10781092 



nicity in plants \ $ 

4861 (2000) t^l^' fsT 
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REFERENCE 5 (bases 1 to 52498) 

AUTHORS Ramos, A. R., Rehm, A.H. and Collmer,A.R. 

TITLE Pseudomonas syringae pv. tomato DC3000 hrpL through hrcU 

JOURNAL Unpublished 
REFERENCE 6 (bases 1 to 52498) 

AUTHORS Alfano,J.R. and Collmer,A. 
TITLE Direct Submission 

JOURNAL Submitted ( 07-FEB-2000) Dept. Biol. Sci . , UNLV, 1854 Maryland 

Parkway, Las Vegas, NV 89154, USA 
REFERENCE 7 (bases 1 to 52498) 

AUTHORS Ramos, A. R., Rehm,A.H. and Collmer,A.R. 
TITLE Direct Submission 

JOURNAL Submitted (22-NOV-2000 ) Plant Pathology, Cornell University, 334 

Plant Sciences Bldg., Ithaca, NY 14850, USA 
REMARK Sequence update by submitter 
REFERENCE 8 (bases 1 to 52498) 

AUTHORS Ramos, A. R., Rehm,A.H. and Collmer,A.R. 
TITLE Direct Submission 

JOURNAL Submitted ( 05-MAR-2001 ) Plant Pathology, Cornell University, 334 

Plant Sciences Bldg., Ithaca, NY 14850, USA 
REMARK Sequence update by submitter 
COMMENT On or before Mar 14, 2001 this sequence version replaced 

gi:3228544, qi: 790906 , qi : 3228541 , qi: 8037790 , qi: 11276506 . 
FEATURES Location/Qualifiers 
source 1. . 52498 

/organism="Pseudomonas syringae pv. tomato" 

/strain="DC3000" 

/db_xref ="taxon : 323" 

/note="exchangeable effector locus flanking the hrp/hrc 
cluster 

pathovar: tomato" 
gene complement ( 908 .. 1972 ) 

/gene="queA" 
CDS complement (908 . .1972) 

/gene="queA" 

/codon_start=l 

/transl table=ll 

/product="QueA" 

/protein id=" AAF71483 . 1 " 

/db_xref="GI : 8037772" 

/ translation="MRVADFTFELPDSLIARHPLAERRSSRLLTLDGPTGALAHRQFT 
DLLEHLRSGDLMVFNNTRVI PARLFGQKASGGKLE I LVERVLDSHRVLAHVRASKS PK 
PGSSILIDGGGEAEMVARHDALFELRFAEEVLPLLDRVGHMPLPPYIDRPDEGADRER 
Y QTV Y AQRAGAVAAPT AGLH FDQP LME AI AAKGVE T AFVT LHVGAGT FQ PVRVE Q I E D 
HHMHSEWLEVSQDWDAVAACRARGGRVIAVGTTSVRSLESAARDGQLKPFSGDTDIF 
I YPGRPFHWDALVTNFHLPESTLLMLVSAFAGYPETMAAYAAAIEHGYRFFSYGDAM 
FITRNPAPTAPQESAPEDHA" 

tRNA 2090. .2177 

/product="tRNA-Leu" 

CDS 2343. .2753 

/note="ORF4" 
/codon_start=l 
/transl table=ll 
/product= "unknown" 
/protein id=" AAF71484 . 1 " 
/db_xref="GI : 8037773" 

/ translation="MNKIVYVKAYFKPIGEEVSVKVPTGEIKKGFFGDKEIMKKETQW 
QQTGWSDCQIDGERLSKDVEDAVAQLNADGYEIQTVLPILSGAYDYALKYRYEIRHNR 
TELSPGDQSYVFGYGYSFTEGVTLVAKKFQSSAS " 
CDS complement (2908 . . 3324) 
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/note="ORF3" 
/codon_start=l 
/transl table-11 
/product^ "unknown" 
/protein id=" AAF71485 . 1 " 
/db_xref="GI : 8037774" 

/translation="MRAYKNLTAKIGGFLLALTIIGTSLPAFAVNDCDLDNDNSTGAT 
CGGNDKDLDNDNVTDAAFGGNDKDMDNDHHTDAAFGGNDKDLDNDHHTDAAFGGNDKO 
LDNDNKTDAAFGGNDRDLDNDNNTDNYNGTPSAAKK" 

CDS complement (3939 . . 4664) 

/note="ORF2 " 
/codon_start=l 
/transl table=ll 
/product= "unknown" 
/protein id=" AAF71486 . 1 " 
/db_xref="GI : 8037775" 

/ translation="MQKTTLWALAFAMLAGCGVSGPAPGSDIQGAQAEMKTPVKLNLD 
AYTSKKLDAVLEARTNKSYMNKGQLIDLVSGAFLGTPYRSNMLVGSANVPEQLVIDFR 
GLDC FAY LDY VE AFRR S T SQQD FVRNLVQVR YKGGDVD FLNRKH F FT DWAY GT AY PVA 
DDITAQISPGAVSVRKRLNERAKGKVYLPGLPWERSMTYIPSRLVDSQWSHLRTGD 
YIGI YTPASRAGCDTRRFLYRDG" 

gene 5571 . . 6410 

/gene="tnpA" 

CDS 5571 . . 6410 

/gene="tnpA" 

/note="truncated at the amino terminus" 

/ codon_start-l 

/transl_table=ll 

/product="TnpA" 

/protein id=" AAF71487 . 1 " 

/db_xref="GI : 8037776" 

/ translation="MGFLRPAYPLMAMLRIHLMQNWFDYSDPAMEEALYETTILRQFA 

GLSLDRIADETTILNFRRLLEKHELAGGILQVINGYLGDRGLMLRQGMWDATIIHAP 

S STKNKDGKRD PEMHQTKKGNQ Y F FGMKAH I GVDAE S GLVHS LVGTAANVADVTQVDQ 

LLHSEETYVSGDAGYTGVDKRAEHQDRQMIWSIAARPSRYKKHGEKSLIARVYRKIEF 

TKAQLRAKVEHPLRVIKRQFGYTKVRFRGLAKNTAOQATL^ 

VRL" 

CDS complement ( 6656 .. 8056) 

/note="ORFl" 
/ codon_start=l 
/transl tableau 
/produc t="un known" 
/protein id=" AAF71488 . 1 " 
/db_xref="GI : 8037777" 

/ translation="MRPVGGPAPGYYPPTYEAERPTAQAAGNDRARSSQASSSPAASV 
APET PMLGDLKR F PAGRY PDMKVENI RLK I E GQE PGGKDGVKHTRRRKPDAAGS S HVH 
GGQSVASTSASAQSKALQDTNFKASDLAELARWCESPHPYALAPSKAAGKSSQLSAW 
VSILLQEGKHALEQRLEAQGLKLADVWSEGRDHLHINLNYLEMDSCLGTSKGLWAPD 
SNDKKLI AKAARY FDDFNAQKLPE LAPLTKMKSKDS LGVMRE LLRDAPGLVI GE GHNS 
TSSKRELINNMKSLKASGVTTLFMEHLCAESHDK7VLNNYLSAPKGS PMPARLKNYLDL 
QSQGHQAPEELHTKYNFTTLVEAAKHAGLRWSLDTTSTYMAPEKAEIKRAQAMNYYA 
AEKIRLSKPEGKWVAFVGATHATSCDGVPGLAELHGVRSLVIDDLGLKSRATVDINVK 
NYGGKLNPDVRLSYKV" 

gene complement ( 8131 .. 10473) 

/gene="hrpK" 

CDS complement (8131. . 10473) 

/gene="hrpK" 
/codon_start=l 
/transl table=ll 
/product="HrpK" 
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/protein- id=" AAF71489 . 1 " 
/db_xref="GI : 8037778" 

/translation="MRISSSPEVIWQPTPGELALAVESPLAKALPTPVGGGGQAGVQ 
FGQPAGNTQGAPTGAEQTASSILALLLQGSGGAAANANVNNQVQPQPCADSPAEAAPT 
EATAPAAPVE QVAAPTNT DAT DASNAPKAADAAF LDNS EYSS PEALKRWE PMVAH L P P 
E E R E QAAKE LNR P I AAAWMAR DD G P DAGKAMD F I NAN P AL KT AVD VAQ GGGNAD GK I T 
NKDLKTFAKNMEKAADSADKDLANYTKDNPDADP 

HHAVDAADKTKVDGNVDAEDLKGLAQSNPGLSGALKQSCSTWSQPGFLGQVDEAGMSG 
RKKAAHS PDKMFDAKNLSEWI KKSAPTNGGQFASMLSDSATLNAVAGIDI SKLDKDVF 
DKPKSYSGAQKAAVMVKLQQTQQSVIAGRSLRNTDKTEQGLNDRISQLQADPDVQAYL 
NKSIPEQERNLVRSDASLQKAWEQTKNVNSGQALQTDMDKADKAVNKHNPNADYSGA 
ISGLSAQLQLQKDLFPDSKVPTTNQVLENKPDLQDKIATSYVTNFSEGGALKQLLGQK 
KSDASQSLQTADNQKAIYDSVLPPDFTENQHENYMNSTLGTLQDSKKGRKLLEGKSDE 
GGPSMLTQLAEAGVGGKALNS I VGFAS VS S LI AHGDKLGATQAI YDSTRMGAEAI KGG 
I DAGAKMMGREAS AGLGR LGGQMI GRAVGMVAGE ATGLAAGAALGAAI PWGWAI DGA 
LAVGFGI SMI IDAVKKHKAQKAFDHNVDPVLDQFGI AKAH " 

gene complement ( 10560 .. 11114 ) 

/gene="hrpL" 
CDS complement (10560. . 11114) 

/gene="hrpL" 

/note^'alternative sigma factor" 

/codon_start=l 

/transl_table=ll 

/product="HrpL" 

/protein id=" AAG33875 . 2 " 

/db_xref="GI: 13325078" 

/ translation="MFQKIVILDSTQPRQPSSSAGIRQMTADQIQMLRAFIQKRVMNP 

DDVDDILQCVFLEALRNEHKFQHASKPQTWLCGIALNLIRNHFRKMYRQPYQESWEDE 

VHSE LE GHGDVSHQVEGHRQLARVI QAI DCLPSNMQKVLE VS LEMDGNYQETANS LGV 

PIGTVRSRLSGARVQLKQQIDPFA" 
gene 11358 . . 12464 

/ gene-"hrpJ" 
CDS 11358.. 12464 

/gene="hrpJ" 

/codon_start=l 

/transl table=ll 

/product="HrpJ" 

/protein id=" AAG33876 . 1 " 

/db_xref="GI : 11276508" 

/translation="MKIVAPPIMRILPVAPTRWTPAAQPLPN7\DLHNSGTSPQQVSR 
FA7\ALIQHRRILIQRDLIASNNALQSRAVKLGELYQLLMTDQDTGLDNAARVLRKKLM 
QDDSSSDLKQVLDYTDGDA7UCAHWLQA7\RKQAEADGEMGEHWLTQQLKQLRRKYGP 
RARAGINSAKAFARPNIDNKRRVALRNLYSVAVSGQPNITGLIEALLGEQQEAGQFNI 
DLRDMRTAIADDLSA^PSASHEQLRTLMHGLNTARHVTTLLKGCEHLLGRMRNKNPD 
LTVDPPAFLKHLLTLTANGMNVNQTLQLTQHIGGKQLKHQLAFLNGLRPMLLQLPILL 
WRDMKSRQTALSNLLILMAELTTQEQKQLYGGLV" 

gene 12464.. 14554 

/gene="hrcV" 

CDS 12464.. 14554 

/gene="hrcV" 

/note="contains 6 transmembrane domains; likely inner 

membrane component of type III protein secretion." 

/codon_start=l 

/transl_table=ll 

/product="HrcV" 

/protein id=" AAG33877 . 1 " 

/db_xref="GI : 11276509" 

/translation="MNQIINFLhnWALSAMRRSEVVGAFFVIAIVFMMITPLPTGLVD 
VLIAVNICISCLLIMLAMHLPRPLAFSTFPAVLLLTTMFRLALSISTTRLILLNQDAG 
HIVEAFGQFWGGNLAVGLVIFLILTWNFLVITKGSERVAEVGARFTLDAMPGKQMS 
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IDSDLRANLISVYEARNRRSELNKESQLFGAMDGAMKFVNGDMASLIXVAINMIGGI 
S I GWQHGMTAGDALQLYTVLT I GDGLIAQI PALLI SVTCGMI ITRVPNTEAGVEANI 
GREIAEQITSQPKAWIIAAVAMLGFAALPGMPTGVFITIAIICGAGGMLQLQRAKPKA 
EEQGAVAVAPGDEWQGKTLRTFS PSRQFVLQFHPGQNSALVDALVSE I RQRRNRLWQ 
FGLTLPS FI I E YVDHLQPDE FR FTVYDVPMLKAT FTQTHVAVDVRQFNGE DVPAAI SG 
TTDRQEDQWVWLPAEQGGELATVSSMTLITERMERALQSCGPQFIGLQETKAILGWLE 
SEQPELAQEMQRVLTLTRFSAVLQRLASECVPLRAIRVIAETLIEHCQHERDTNVLTD 
YVRIALKSQIYHQYCGADGLQVWLLTPESEGLLRDGLRQTQTETFFALSNDISQMLVQ 
QLHIAFPLRAPEQAVLLVAQDLRSPLRTLLREEFYHVPVLSFAEISNAAKVRVMGRFD 
LEDDLEPMDNEHAA" 

gene 14588. .15580 

/gene="hrpQ" 

CDS 14588. .15580 

/gene="hrpQ" 

/note="likely involved in type III protein secretion" 

/codon_start=l 

/transl_table=ll 

/product="HrpQ" 

/protein id=" AAG33878 . 1 " 

/db_xref="GI: 11276510" 

/ translation="MFELRVLNGQHQGAALPLIGEQWSIGSAQQQDLALDDSGVESLH 
CRLQRLDDSWTLNAEEGAVCDEEGHSQASVDLTLNTAFMLGSVWLCVSPAGDEWPSVP 
AVIPKQPQAEAEPARHEAPLEKVKSRSQFLNRTTGIIAGLLVGVIGSAWSLTRPAAIV 
LDQGPAHIASAMTTAPGMPNAAPKTSQPAAKPAGPATDKRVRLASMEAVRHQLTTMLS 
DRMLTDVSVAETPEGLMLNGHLKEESLWYQRMLQRFKDRYDSPVTVLDNVASARNTL 
P FWVQ IMT GPHAH LVTADGRRMYVGDE KDGLRLTKI DDQRLQFDGDRH I EVTW " 

gene 15667 . . 16926 

/ gene="hrcN" 

CDS 15667.. 16926 

/ gene="hrcN" 

/note="contains ATPase domain; likely inner membrane 

component of type III protein secretion" 

/codon_start=l 

/transl_table=ll 

/product="HrcN" 

/protein id=" AAG3387 9 . 1 " 

/db_xref="GI : 11276511" 

/ 1 ranslation="MRGILLECKI PAAPCVGDLCEVSKADGSFLLAE IVGFTQECTLLS 
ALGAPDGIQVGAPIRPLGIAHRIGVDDTLLGCVLDGFGRPLLGDCLGAFAGPDDRRDT 
LPVIADALPPTQRPRITRSLPTGIRAIDSAILLGEGQRVGLFAGAGCGKTTLMAEIiAR 
NMDCDVIVFGLIGERGRELREFLDHELDETLRARSVLVCATSDRSSMERARAAFTATA 
IAEAFRARGQKVLLLLDSLTRFARAQREIGIASGEPLGRGGLPPSVYTLLPRLVERAG 
MSENGS I TALYTVLIEQDSMNDPVADEVRSLLDGHI VLSRKLAERGHY PAIDVSAS I S 
RILSNVTGRDHQRANNRLRQLLAAYKQVEMLLRLGEYQAGADPVTDCAVQLNDAINAF 
LRQDLREPVPLQETLDGLLRITSQLPE " 

gene 16926. . 17375 

/gene="hrpO" 

CDS 16926. . 17375 

/ gene =,, hrpO" 

/note= M likely involved in type III protein secretion" 

/codon_start=l 

/ trans It able=ll 

/product= M HrpO n 

/protein id=" AAG338 80 . 1 " 

/db_xref="GI : 11276512" 

/translation="MMDETLEEDPQREALEQVISLLTPLRQHRQASAERAHRQAQVEL 
KSMLDHLSETRASLDQERDNHKRRRESLSQDHLQKTISLNDVDRWHEKEKNMLDRLAF 
I RQDVQQQQ LR VAE QQT LLE HKR LQAKAS QRAVE K LACME ETLNEEG" 
gene 17382.. 17951 

/gene="hrpP" 
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CDS 17382.. 17951 

/gene="hrpP" 

/note="likely involved in type III protein secretion" 

/ codon_start=l 

/ transl table=ll 

/product="HrpP" 

/protein id=" AAG33881 . 1 " 

/db_xref="GI : 11276513" 

/translation="MTAPIKTPAKAPPAPQRPAPVERPATALSEKPFGEKRPASGGLG 
RGDVQTSRSVLSKSLQDT PMSADGMFFSQLLVPPVSAEPGQQGFGGGGIAFPVRAEQV 
PTQLIDELAQRLPDQPDGPLSFTLLMPSLGSVRVNANKTENRWSVQLGFARRDVLKRL 
SAHTGACRDSLSQALGQDVELDMHEDLSA" 

gene 17942.. 18664 

/gene="hrcQa" 

CDS 17948.. 18664 

/gene="hrcQa" 

/note="likely involved in type III protein secretion" 

/codon_start=l 

/transl_table=ll 

/product="HrcQa" 

/protein id=" AAG338 82 . 1 " 

/db_xref="GI : 11276514" 

/ translation="MSALCLRKVDPLLAICVSRELGAGQCLSFSAKGQQQELTLLPLIA 
SDETPAKGVWLNTAVGALCLSDAEALLSLLGDIPLTLGGEQQAWYWQFFNQRLSPTIA 

DLLAPVE PLPQACAE LTVGCR I QVRLADQQVHAHLHAT PETLLTLLRS AAWQARLKPL 
DERWSISTPLILGELSLTLEQLASLRPGDVLLPANCKFDSAGQGFLTLAGRQWAAQTD 

SQDQHLLLRLSHEEHSHHEY" 
gene 18654 . . 19067 

/ gene="hrcQb" 
CDS 18654. . 19067 

/ gene= " hrcQb " 

/note="likely involved in type III protein secretion" 

/codon_start=l 

/ transl table=ll 

/product="HrcQb" 

/protein id=" AAG33883 . 1 " 

/db_xref="GI : 11276515" 

/ translation="MSTEDLYQDDVESLEDYDDETAEQEHEHEHEQQWAEPDDESEYA 

EAEPDDDEQEEQEEQQAPSGLDSLALDLTLRCGELRLTLAELRRLDAGTILEVGGVAP 

GYATLCHGERWAEGELVDVDGRLGLQITRLAAQP" 
gene 19091.. 19717 

/gene="hrcR" 
CDS 19091.. 19717 

/gene="hrcR" 

/note="contains 3 transmembrane domains; likely inner 

membrane component of type III protein secretion system" 

/codon_start=l 

/transl table=ll 

/product="HrcR" 

/protein id=" AAG338 84 . 1 " 

/db_xref ="GI : 11276516" 

/translation= M MLALFLGAMSLIPFLLIVCTAFLKIAMTLLITRNAIGVQQVPPN 
MALYGIALAATMFVMAPVAHEMQQRVHDHPLELGNTEKLQASARTVIEPLQRF1W 
DPDWAHLLDNTQRMWPKEMADQASKNDLLLAIPAFVLSELQAGFEIGFLIYIPFIVI 
DLIVSNLLLALGMQMVSPMTLSLPLKLLLFVMVSGWSRLLDSLFYSYM" 

gene 19725. . 19991 

/gene="hrcS" 

CDS 19725.. 19991 

/gene="hrcS" 

/note="contains 2 transmembrane domains; likely inner 
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membrane component of type III protein secretion system" 

/codon_start=l 

/transl table- 11 

/product="HrcS" 

/protein id=" AAG33885 . 1 " 

/db_xref="GI: 11276517" 

/translation="MEALALFKQGMFLWILTAPPLGVAVLVGVITSLLQALMQIQDQ 

T L P F GI K LAAVGMT LAMT GRWI GVE L I Q F I NMAFDL I AR S GWH " 
gene 19992.. 20786 

/gene="hrcT" 
CDS 19992.. 20786 

/gene="hrcT" 

/note="contains 3 transmembrane domains; likely inner 

membrane component of type III secretion system" 

/ codon_start=l 

/transl table=ll 

/product="HrcT n 

/protein id=" AAG33886 . 1 " 

/db_xref="GI : 11276518" 

/ transl a t i on= "MPLDAQNFFDLI LGMGLAMARLVPCMLLVPAFCFKY LKGPLRYA 
WAWAMIPAPGISRALTSLNDDWFAI GGLLLKEWLGTLLGMLLYAPFWMFASVGAL 
LDSQRGALSGGQINPSLGPDATPLGELFQETLVMLVLISGGLSLITQVIWDSYMVWPP 
TSWLPGMTAEGLDVFLGQLNQTLQHMMLYAAPFIAIjLLLI EAALAI IGLYAQQLNVS I 
LAMPAKSMAGI AFLLVY LPTLLE LGTGE LSKLADLKS I LGFWQVP " 

gene 20927. .21862 

/ gene="hrcU" 
CDS 20927. .21862 

/gene="hrcU" 

/note="contains 4 transmembrane domains; likely inner 

membrane component of type III protein secretion system" 

/ codon_start=l 

/transl table=ll 

/product="HrcU" 

/protein id=" AAG338 87 . 1 " 

/db_xref="GI : 11276519" 

/ translation="MNRLQALIiALTFKGIERPFMSAVELIASEGLSVVLSFTLCSVGL 
AMLMRLISSWVQIGFLFAPK7U.KLDIKKIDPFSHAKQMFSGQNILNLLLSILKAVAIG 
ATLYTQVKPALGTLILLANSDLATYLHALIELFQHVLRVILGLLLVIALIDFAMQKYF 
HAKKLRMSHEDIKKE YKQSEGDPHVKGHRRQLAHE I LNQE PSAAPKPVEEADMLLVNP 
THYAVALYYRPGETPLPMIHCKGEDEDALALIAQAKKAGIPWQSIWLARTLYKVWG 
KYIPRPTLLAVGHIYKWRQLEEITDEVIRIDDDM" 

gene complement (22405 .. 25713) 

/gene="hrpC operon" 

gene complement (22405 . .22764) 

/gene="hrpV" 

CDS complement (22405. .22764) 

/gene="hrpV" 

/function= "negative regulator" 

/codon_start=l 

/transl table=ll 

/product="HrpV" 

/protein id=" AAC34758 . 1 " 

/db_xref="GI : 3228549" 

/translation="MIEVTEKSAFYAQVAAQSPAVWPVANGVAFVSRREHHDWGIALH 
IEGRALRPDQLRDALQRRET4ESERFNHYFLFLDVRRDFVVWHAVNEKPGSYASLDDIH 
RHELMLAGLDHLSEEMH" 

gene complement (22767 . .22970) 

/gene="hrpT" 
CDS complement (227 67 . .22970) 

/gene="hrpT" 
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/note="Hrp ( type III) secretion pathway associated protein" 

/ codon_start=l 

/transl table=ll 

/product="HrpT" 

/protein id=" AAC34757 . 1 " 

/db_xref="GI : 3228548" 

/translation="MMIRSLTLLLLLFATLTGCATHGCAGNACKRPDSNNRELVIWWP 
PDMREGLDDRDHERDYTWQLKD" 

gene complement (22966 .. 25069) 

/gene="hrcC" 
CDS complement (22967. .25069) 

/gene="hrcC" 

/note="outer-membrane protein of Hrp(type III) secretion 

pathway; formerly designated HrpH" 

/codon_start=l 

/transl table=ll 

/product="HrcC" 

/protein id=" AAC34756 . 1 " 

/db_xref="GI : 3228547" 

/translation="MRK7UjMWLPLLLIGVIPATWAVTPEAWKHTAYAYDARQTELSTA 
LADFAREFGMSLDMS PVQGNLDGRIRAQNPEEFLERLSQEYHFQWFVYNDTLYVSPSS 
E HT S GR I E VS PDAVDDLQT ALT DVGL VDKR L GWGS L PDE GWLVRG PAK Y VE FVRD Y S 
KKVEKPDEKADKQDVWLPLKYANAADRTIRYRDQQLWAGVASILQELLESRSRGES 
I DS VNLLAGQGS S VANSTGVAAPACLTTWAPMVS I REHCNRGI APR I ELQQQKNCQGS 
CLRQGNIRVSADVRNNSVLIYDLPERKAMYQKLVKELDVPRNLIEIDAVILDIDPNEL 
AELSSRLEFQCPAASEVGANLFDGGTSSTLFLQNASKFSAELHALEGNGSASVIANPS 
ILTLENQPAVIDLSRTEYLTPTSERAADILPITAGTSLQVIPRSLDNDGKPQVQMIVD 
IEDGQIDVSTINDTQPSVRRG>TVSTQAVIAEHGSLVIGGFHGLEANDRIHKIPLLGDI 
PYIGKLLFQSRSRELSQRERLFILTPRLIGDQVNPARYVQNGNPHDVDDQMKKIKERR 
DGGRLPTRGDIQKVFTQMIDGPAPEGLRAGQTLPFETRSLCDPGEGLTLDGQRSQWFV 
K KDW GV AVWARNNT DKPVRIDESRC GGR WV I GVAAW P HAWL Q PGEESEVYI AVR Q PQ 
ISKMAKESRPSLLRGAKP" 

gene complement (25071 25502 ) 

/gene="hrpG" 

CDS complement (25071. .25502) 

/gene="hrpG" 

/note="Hrp ( type III) secretion pathway associated protein" 

/codon_start=l 

/transl table=ll 

/product= n HrpG" 

/protein id=" AAC34755 . 1 " 

/db_xref ="GI : 3228546" 

/translation="MEFSEFTEIVGQWSAQRPAMPLDCWIDDANARLAWGDGVRCSI 
ELLDPYDAGDPQRLEALLGEGGASLACACDGALAIDPQTRCVVLVSWM7\NPCSLGDLL 
DRLESLANQRAGMLSLMQTTIRNTTPTLSGRSILNHRQPGV" 

gene complement (25489 25713 ) 

/gene =,, hrpF" 

CDS complement (25489. .25713) 

/gene="hrpF" 

/note="Hrp ( type III) secretion pathway associated protein" 

/codon_start=l 

/transl_table=ll 

/product="HrpF" 

/protein id=" AAC34754 . 1 " 

/db_xref="GI : 3228545" 

/ translation="MISFNSLQRHLDNSVSRAHTNMDQAAMEASESGTVEDLQAFTMS 
QQQVDVAGIAVNECLRAKHGINKAVIDGIQ" 

gene complement (25808 . .26389) 

/gene="hrpE" 
CDS complement (25808. .26389) 
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/gene="hrpE" 

/no te= "putative" 

/codon_start=l 

/ transl table=ll 

/product="HrpE" 

/protein id=" AAB00131 . 1 " 

/db_xref="GI: 790912" 

/translation= M MLAKRSIALTAATLLREPILRREDIADSLLARDILADARQQATQ 
IIlALEQEKAEHLQQQAIlAQE T WENANAFLGELQVQREALQEQA^4TAVEELLSESLRHLL 
DDTTLAERARALARNLAASQLNEAVATLSVHPQIADPVAEWLADSRFSEHWQLKRDAT 
IASDSLRLSDANGAFDIAWADLRKGLLGVEPAA" 

gene complement (26425 .. 26826) 

/gene^'hrpD" 
CDS complement {26425. .26826) 

/gene="hrpD" 

/codon_start=l 

/transl table-11 

/product="HrpD" 

/protein id=" AAB00130 . 1 " 

/db_xref ="GI : 790911" 

/translation =l, MASRHGVFLQSLGIDPAQPPAPAE PVLRWLALTPSQREQALSLA 
QCICFSRNESDGPDGQWCWGLTKALRPGVWLEFEHEDARLLLGAWLGPQYWSRLCLEC 

PPNEVPDTPGKAPENKLQALWQAIMWRVTAA" 
gene complement (26952 .. 27758 ) 

/gene= M hrpC l, 
CDS complement (26952. .27758) 

/gene="hrpC" 

/note= ,, putative M 

/codon_start=l 

/ transl _table=ll 

/product="HrpC" 

/protein id=" AAB00129 . 1 " 

/db_xref="GI : 790910" 

/translation="MNFLSAGLLLLCMLLLGGCSDETDLFTGLSEQDSNEWARLADQ 
HIDARKRLEKTGVWTVATSEMNRAVRVLDAAGLPRRSRTTLGEIFKKEGVISTPLEE 
RARYIYALSQELEATLSQIDGVIVARVHWLPERIAPGE PVQPASAAVFIKHSAALDP 
DSVRGRIQQMVASS I PGMSTQSVDSKKFS I VFVPAAE FQETTQWVS FGPFKLDSTNLP 
FWNLMLWVAPVGLALVLLI GALLVRS DWRAS LLRRI GFGSRGRSTLPARA" 
misc feature complement ( 26955 ) 

/gene="hrpC" 

/note="possible alternative translational start site for 
hrpD" 

gene complement (27767 .. 28141 ) 

/gene="hrpB" 
CDS complement (27767 . .28141) 

/gene="hrpB" 

/codon_start=l 

/evidence=experimental 

/transl table=ll 

/product="HrpB" 

/protein id=" AAB00128 . 1 " 

/db_xref="GI : 790909" 

/translation="MTISQLSNLKSVSPELGQNAHQGLGSEPVQADVDLFNAAMRPDS 
GPAASHLSDRIASALSERLGSTEKLSQQALRQMKKVSNTEDPGDIVQMSRALSQCSLQ 
MALTTKWSKS AQALDKLTNLQ " 

gene complement (28169 . .29281) 

/gene="hrpZ" 
CDS complement (28169 . .29281) 

/gene="hrpZ" 

/codon start=l 



http://www.ncbi. nlm.nih.gov/entrez/quei7-fcgi?cmd=Retrieve&db=nucleotide&list_m 2/4/03 



/evidence=experimental 
/transl table=ll 
/product="HrpZ " 
/protein id= M AAB00127 . 1 " 
/db_xref="GI : 790908" 

/translation="MQALNSISSLQTSASLFPVSLNSDVSANTSTSSKELKAVIDQLV 
QALTQS GQLDETS PLGKMLAKAMAADGKSANS I DDI TAS LDKLI HEKLGDNFGAS AGI 
GAGGGGGGIGGAGSGSGVGGGLSSDAGAGQSDLMSQVLNGLGKAVLDDLLTPSGEGGT 
TFSSDDMPTLEKVAQETyTODNKAQFPTRDGGSWMNELKEDNGLDAQETAQFRSALDVIG 
QQLGQQQGDASGVTSGGGLGSPVSDSSLGNPAIDANTGPAANGNASVDVGQLIGQLID 
RGLQSVSSGGGLGTPVDNSTQPTGGTPVANPTGNVSNQDLGQLLSGLRQRGLEATLQD 
AGNT GAD LQS S AAQVAAQ L I NAL LQ GT NNQ T NQ AVA 1 1 

gene complement (29337 29678 ) 

/ gene =,, hrpA M 

CDS complement (29337 . .29678) 

/gene="hrpA" 
/codon_start=l 
/evidence=experimental 
/transl table=ll 
/product="HrpA" 
/protein id=" AAB00126 . 1 " 
/db_xref="Gl: 790907" 

/ transl a tion="MVAFAGLTSKLTNLGNSAVGGVGGALQGVNTVASNATLQKNILL 
GTGDSLSVDAQAKASKESDANGAKLIAMQAQETMKKQT^4DVLNAIQAGKEDSTNKKIS 
ATATNAKGI S Y " 

gene complement (29772 31671 ) 

/gene="hrpRS operon" 
gene complement (29772 .. 30680 ) 

/gene="hrpS" 
CDS complement (29772 . .30680) 

/gene="hrpS" 

/f unction="predicted positive transcriptional regulator of 

hrp and avr gene expression" 

/codon__start=l 

/transl table=ll 

/product="HrpS" 

/protein id=" AAC34753 . 1 " 

/db_xref="GI : 3228543" 

/ translation="MSLDERFEDDLDEERVPNLGIVAESISQLGIDVLLSGETGTGKD 
T I ARR I HEMSARKGRLVAMNCAAI PE S LAE S E L FGWS GAYT GADRS R VGY VE AAQGG 
TLYLDEIDSMPLALQAKLLRVLETRALERLGSTSTINLDICVIASAQCSLDDAVERGQ 
FRRDLYFRLNVLTLKLPPLRNQSDRIVPLFTRFTAAAARELGVPVPDVCPLLHKVLLG 
HDWPGNIRELKAAAKRHVLGFPLLGAEPQGEEHLACGLKSQLRVDEKALIQESLKRHD 
NCLDSVSLELDCHARTLYRRIKELQI " 

gene complement ( 30724 .. 31671 ) 

/gene="hrpR" 

CDS complement (30724 . .31671) 

/gene="hrpR" 

/f unction="predicted positive transcriptional regulator of 

hrp and avr gene expression" 

/codon_start=l 

/transl table=ll 

/product="HrpR" 

/protein id=" AAC34752 . 1 " 

/db_xref="GI: 3228542" 

/transl a tion="MSTGIDKDVRECWGVTALSAGHQIAMNSAFLDMDLLLCGETGTG 
KDTLANRIHELSSRSGPFVGMNCAAI PESLAESQLFGWNGAFTGVCRAREGYIEASS 
GGTLYLDEIDSMPLSLQAKLLRVLESRGIERLGSTEFIPVDLRIIASAQRPLDELVEQ 
GLFRRDLFFRLNVLTLHLPALRKRREQILPLFDQFTQGIAAEFGRPAPALDSGRVQLL 
LSHDWPGNIRELKSAAKRFVLGFPLLGADPVEALDPATGLRTQMRIIEKMLIQDALKR 
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HRHNFDAVLQELELPRRTLYHRMKELGVAAPILCEVRGL" 
CDS complement (32753. .34240) 

/note="ORFl" 
/ codon_start=l 
/transl table=ll 
/product= "unknown" 
/protein id=" AAF71498 . 1 " 
/db_xref="GI : 8037791" 

/translation= M MPAVAFPVSSPRLLARAVQIAVLAMGALCVGCQSVDYSPPRQDR 

PPRLVSGLVYWGGDDKEGSRAADKISDLPVYNGEDVWQRVAQRCRLVDGQGMNERIAR 

QRDWLLSNRGFITGASVRASPYLHFIVERLDERNMPLELALLPMIESSYNPMANSPAA 

AAGLWQFIPSTGRSFNLHQSATYDARRDWASSKAAMDYLTRLHDQFNNDWLLALAAY 

NAGEGTVGRAIEANRRRGLPVDYWNLNLPKETQDYVPRLLALSMWRNPNAYGVKLTP 

VANTPY FDWE LNHAVDLTQLAATAGVDE GQLLRLNS AFLRKKT FDGPGRLLI PKTQN 

RVLTTSIARITGESPVTASIPLVTPSAIVKRAAERLPAPAPVAERPVRVAERPPLPKP 

VQSAQLAENLPS PKTAQPLPVAQQPTAPEALQPAPAAESAPAVE I AQSAPWE PDPLT 

NRIQLPRIRDDRSSPRKRDDDEYRSGPRELPTGPRVWFASDPRQ" 

gene 34461 . . 39849 

/ gene="avrE" 

CDS 34461.. 39848 

/gene="avrE" 
/codon_start=l 
/ transl table=ll 
/product="AvrE" 
/protein id=" AAF71499 . 1 " 
/db_xref= M GI: 8037792" 

/translation="MQSPSIHRNTGSIIQPTVTPDARAATDLQERAEQPRQRSSHSLS 
SVGKRALKSVGKLFQKSKAPQQKAATPPTAKNVKTPPPASNVATPRNKARESGFSNSS 
PQNTHRAPKWI LRNHPNQASSSGAQTHE I HPEAAPRKNLRVRFDLPQDRLERS PS YLD 
SDNPMTDEEAVANATRQFRS PDSHLQGSDGTRI SMLATDPDQPSSSGSKI GDSDGPI P 
PRE PMLWR SNGGR FE LKDE KLVRNSE PQGS I QLDAKGKPD FS T FNT PGLAPLLDS I LA 
: TPKQTYLAHQSKDGVHGHQLLQANGHFLHLAQDDSSLAVIRSSNEALLIEGKKPPAVK 
MEREDGNIHIDTASGRKTQELPGKAHIAHITNVLLSHDGERMRVHEDRLYQFDPISTR 
WKIPEGLEDTAFNSLSTGGNGSVYAKSDDAWDLSSPFMPHVEVEDLQSFSVAPDNRA 
ALLSGKTTQAI LLTDMS PVI GGLTPKKTKGLE LDGGKAQAAAVGLSGDKLFIADTQGR 
LYSADRSAFEGDDPKLKLMPEQANFQLEGVPLGGHNRVTGFINGDDGGVHALIKNRQG 
ETHSHALDEQSSKLQSGWNLTNALVLNNNRGLTMPPPPTAADRLNLDRAGLVGLSEGR 
I QRWDAT PECWKDAGI KDI DRLQRGADSNAY VLKGGKLHALKI AAEHPNMAFDRNTAL 
AQT AR S TKVEMGKE I E GLDDRV I KAFAMVSNKR FVALDDQNKLT AHS KDHKPVT LDI P 
GLEGDIKSLSLDEKHNLHALTSTGGLYCLPKEAWQSTKLGDQLRARWTPVALPGGQPV 
KALFTNDDNVLSAQIEDAEGKGLMQLKAGQWQRFEQRPVEENGLNDVHSRITGSNKTW 
RIPKTGLT LRMDVNT FGRS GVE KS KKAS T S E F I RAN I Y KNTAE T PRWMKNVGDH I QHR 
YQGRLGLKE VYETESMLFKQLE LI HE S GGR PPARGQDLKARI TALEAKLGPQGATLVK 
ELETLRDELENHS YTALMS I GQSYGKAKNLKQQDGI LNQHGELAKPSVRMQFGKKLAD 
LGTKLNFKSSGHDLVKELQDALTQVAPSAENPTKKLLGTLKHQGLKLSHQKADIPLGQ 
RRDASEDHGLSKARLALDLVTLKSLGALLDQVEQLPPQSDIEPLQKKLATLRDVTYGE 
NPVKWTDMGFTDNKALESGYESVKTFLKSFKKADHAVSVNMRAATGSKDQAELAGKF 
KSMLKQLEHGDDEVGLQRS YGVNLTTPFI I LADKATGLWPTAGATGNRNYI LNAERCE 
GGVTLYLISEGAGNVSGGFGAGKDYWPGFFD ANN PARS VDVGNNRTLTPNFRLGVDVT 
ATVAASQRAGWFNVPDEDIDAFVDDLFEGQLNPLQVLKKAVDHESYEARRFNFDLTA 
GGTADIRAGINLTEDRDPNADPNSDSFSAWRGGFAANITVNLMTYTDYSLTQKNDKT 
ELKEGGKNRPRFLNNVTAGGQLRAQIGGSHTAPTGTPASAPGPTPASQTAANNLGGAL 
NFSVENRTVKRIKFRYNVAKPITTEGLSKLSKGLGEAFLDNTTKAKLAELADPLNARY 
TGKKPDEVIQAQLDGLEELFADIPPPKDNDKQYKALRDLKRAAVEHRASANKHSVMDN 
ARFETSKTNLSGLSSESILTKIMSSVRDASAPGNATRVAEFMRQDPKLRAMLKEMEGS 
IGTLARVRLEPKDSLVDKIDEGSLNGTMTQSDLSSMLEDRNEMRIKRLWFHTATQAE 
NFTSPTPLVSYNSGANVSVTKTLGRINFVYGADQDKPIGYTFDGELSRPSASLKEAAG 

DLKKEGFELKS " 
CDS complement (39897 . .40292) 

/note="ORF2" 
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/ codon_start=l 
/transl table=ll 
/product= "unknown" 
/protein id= n AAF71500 . 1 " 
/db_xref="GI : 8037793" 

/t ran slation="MTMKTSQPD FAR FINS LGAQLGTSLTLQNGVCALYDGQNNEAAI 
IELPEHSEMVIFHCRIGRCPERAPDLLRLLSLNFDVARLHGCWFAVDQGDVRLCAQRE 
LAS LDE PAFCDVTRGFI SQAREARAFLQA" 
CDS complement (40631. .42502) 

/note="ORF3" 
/codon_start=l 
/transl table=ll 
/product= "unknown" 
/protein id=" AAF71501 . 1 " 
/db_xref="GI : 8037794" 

/translation="MISSRIGGAGGVKLSRVNQQHDTVPAQTAHPNAVTAGMNPPLTP 

DQSGSHATESSSAGAARLNVAARHTQLLQAJTKAEHGTAPVSGAPMISSRAALLIGSLL 

QAEPLPFEVMAEKLSPERYQLKQFQGSDLQQRLEKFAQPGQIPDKAEVGQLIKGFAQS 

VADQLE H FQLMHDAS PATVGQHAKADKAT LAVSQTALGE Y AGRASKAI GE GLSNS I AS 

LDEHISALDLTLQDAEQGNKESLHADRQALVDAKTTLVGLHADFVKSPEAKRLASVAA 

HTQLD^^WSDLVTARNTVGGWKGAGPIVAAAVPQFLSSMTHLGYVRLSTSDKLRDTIP 

ETSSDANMLKAS 1 1 G1WAGIAHETWS WKPMFQAA^ 

VIPDPFELKSEHGELVKKTPEEVAQDKAFVKSERALLNQKKVQGSSTHPVGELMAYSA 
FGGSQAVRQMLNDVHQINGQTLSARALASGFGGAVSASSQTLLQLKSNYVDPQGRKIP 
VFTPDRAESDLKKDLLKGMDLRE PS VRTTFYSKALSGIQS SALTS ALPPVTAQAEGAS 
GTLSAGAILRNMALAATGSVSYLSTLYTNQSVTAEAKALKAAGMGGATPMLDRTETL" 
CDS complement (42499. .42993) 

/note="ORF4" 
/codon_start=l 
/transl_table=ll 
/product^ "unknown" 
/protein id=" AAF71502 . 1 " 
/db_xref="GI : 8037795" 

/translation="MTNNDQYHTLINEICALSLISTPERFYESANFKISEVDFTLQFQ 
D R D E GRAV L I Y GDMGAL PAR GR E S AL LALMD INF HM F AGAH S PA F S FN AQT GR V L LMG 
SVALERASAEGVLLLMKSFSDLAKEWREHGFMGQATTAGSSTDQPVAPA7UCRESLSAP 

GRFQ" 

gene 43299. . 44573 

/gene="hrpW" 
CDS 43299. . 44573 

/gene="hrpW" 

/codon_start=l 

/transl table=ll 

/product="HrpW" 

/protein id=" AAF71503 . 1 " 

/db_xref ="GI : 8037796" 

/translation="MSIGITPRPQQTTTPLDFSALSGKSPQPNTFGEQNTQQAIDPSA 
LLFGSDTQKDVNFGTPDSTVQNPQDASKPNDSQSNIAKLISALIMSLLQMLTNSNKKQ 
DTNQEQPDSQAPFQNNGGLGT PS ADS GGGGT PDATGGGGGDT PS ATGGGGGDT PTATG 
GGGS GGGGT PT AT GGGS GGT PT AT GGGE GGVT PQIT PQ LAN PNRT S GT GS VS DTAGST 
EQAGKINWKDTIKVGAGEVFDGHGATFTADKSMGNGDQGENQKPMFELAEGATLKNV 
NLGENEVDGIHVK7\KNAQEVTIDNVHAQNVGEDLITVKGEGGAAVTNLNIKNSS7\KGA 
DDKWQLNANTHLKIDNFKADDFGTMVRTNGGKQFDDMSIELNGIEANHGKFALVKSD 
SDDLKLATGNIAMTDVKHAYDKTQASTQHTEL" 
CDS complement (44657 . .46117) 

/note="ORF5" 
/codon_start=l 
/transl table=ll 
/product=" unknown 11 
/protein id=" AAF71504 . 1 " 



http://www.ncbi.nlm. nih.gov/entrez/query .fcgi?cmd=Retrieve&db-nucleotide&list_uids=1332... 2/4/03 



A 



/db_xref="GI : 8037797" 

/translation="MHINRRVQQPPVTATDSFRTASDASLASSSVRSVSSDQQREINA 
IADYLTDHVFAAHKLPPADSADGQAAVDVHNAQITALIETRASRLHFEGETPATIADT 
FAKAEKLDRLATTTSGALRATPFAMASLLQYMQPAINKGDWLPAPLKPLTPLISGALS 
GAMDQVGTKMMDRATGDLHYLSASPDRLHDAMAASVKRHSPSLARQVLDTGVAV^^ 
ARNAVR T V LAP ALAS R P AVQGAVD L GV S MAGGLAAN AG F GNR LLSVQSRDHQR GGAL V 
L GLKDKE PKAQ L S E E NDWLE AY KAI K S AS Y S GAALNAGKRMAGL P LDMAT DAMGAVRS 
L V S AS S LT QN GLALAGG F AGV GK LQ E MAT KN I T D PAT KAAV S Q LT NLAGS AAV F AGWT 
TAALTTDPAVKKAESFIQDTVKSTASSTTGYVADQTVKLAKTVKDMGGEAITHTGASL 
RNTVNNLRQRPAREADIEEGGTAASPSEIPFRPMRS" 
CDS 46489.. 47562 

/note="ORF6" 
/codon_start=l 
/transl_table=ll 
/p roduct= " unknown " 
/protein id=" AAF71505 . 1 " 
/db_xref="GI : 8037798" 

/translation="MSGPFEKKWRCFTRTVTYVGWSLFWLLLWDVAVTVDVMLIEGKG 
IDFPLMPLTLLCSALIVLISFRNSSAYNRWWEARTLWGAMVNTSRSFGRQVLTLIDGE 
RDDLNNPVKAILFQRHVAYLRALRAHLKGDVKTAKLDGLLSPDEIQRASQSNNFPNDI 
LNGSAAVISQAFAAGQFDSIRLTRLESTMVDLSNCQGGMERIANTPLPYPYVYFPRLF 
STLFCILMPLSMVTTLGWFTPAISTWGCMLLAMDRIGTDLQAPFGNSQHRIRMEDLC 
NTIEKNLQSMFSSPERQPLLADLKSPVPWRVAN7^SIGGLSRQKNRLGEGARLIASESL 
L W AP FR S VAD V AP C HAS AY LR RA " 
CDS complement (47471. . 48523) 

/note="ORF7" 
/codon_start=l 
/ transl_table=ll 
/product= "unknown" 
/protein id=" AAF71506 . 1 " 
/db_xref="GI : 8037799" 

/ translation="MYIQQSGAQSGVAAKTQHDKPSSLSGLAPGSSDAFARFHPEKAG 
AFVPLEGHEEVFFDARSSFSSVDAADLPSPEQVQPQLHSLRTLLPDLMVSIASLRDGA 
TQYIKTRIKAMADNSIGATANIEAKRKIAQEHGCQLVHPFHQSKFLFEKTIDDRAFAA 
D Y GRAGGDGHAC LGLS VNWCQS RAKGQSDEAF FHKLE D YQGDALL PRVMGFQH I EQQA 
YSNKLQNAAPMLLDTLPKLGMTLGKGLGRAQHAHYAVALENLDRDLKAVLQPGKDQML 
LFLSDSHAMALHQDSQGCLHFFDPLFGWQADSFSNMSHFLADVFKRDVGTHWRGTEQ 
RLQLSEMVPRADFHLR " 
CDS complement (48537 . . 49016) 

/note="ORF8" 
/codon_start==l 
/transl table=ll 
/product=" unknown" 
/protein id=" AAF71507 . 1 " 
/db_xref="GI : 8037800" 

/ translation="MRPVEAKDRLYQWLRNRGIDAQEGQRHNVRTANGSECLLWLPEQ 
DTSLFIFTQIERLTMPQDWILILAMALNLEPARTGGATUiGYNPDSRELLLRSVHSMA 
DLDETGLDHLMTRISTLAVSLQRYLEDYRRQEQAGKTAQKEPRFLPAVHLTPRTFMT" 
CDS 49233.. 49520 

/note="ORF9" 
/codon_start=l 
/transl table=ll 
/product= "unknown" 
/protein id=" AAF71508 . 1 " 
/db_xref="GI : 8037801" 

/ trans la tion="MLKKCLLLVI SMS LGGCWSLMIHLDGERCIYPGTRQGWAWGTHN 
GGQSWPILIDVPFSLALDTLLLPYDLTAFLPENLGGDDRKCQFSGGLNVLG" 
CDS complement (49583. .50029) 

/note="ORF10" 
/codon start=l 
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gene 



CDS 



CDS 



/transl table=ll 
/product= "unknown" 
/protein id=" AAF71509 . 1 " 
/db_xref="GI : 8037802" 

/translation="MKQVEVQIITELPCQVLILEQEAVAEGFRFLTRLIEEWRSGKNR 

FEAKGECLMWLLDGALAGIGGLSRDPHARGDMGRLRRLYVASASRGQGLGKTLVNRL 

VEHAAQEFFAVRLFTDTPSGAKFYLRCGFQAVDEVHATHIKLLRRV" 

complement (50437. .51063) 

/gene="gstA" 

complement (50437 . .51063) 

/gene="gstA" 

/codon_start=l 

/transl table=ll 

/product="GstA" 

/protein id=" AAF71510 . 1 " 

/db_xref="GI : 8037803" 

/translation="MIIVHHLNNSRSQRILWLLEELGLPYEIKRYQRDPKTNLAPPEL 

KAIHPLGKSPVIEDGPHWIESGAIVDYLIRRHGDGRLQPDPASATYDEYVQWLHFAE 

GSAILPLMLNLYVGRLGDAGAPLHPRIHSELANYLGYLNEALGLTPYLLGDELSGADI 

QMS F I GE FAKMQ GMLQ PH PNLAAWVQRCQE R PAY RQALEQ GGE Y S FAK " 

complement (51153 . .51569) 

/note= ,, ORFll" 

/codon_start=l 

/transl_table-ll 

/product= "unknown" 

/protein id=" AAF71511 . 1 " 

/db_xref="GI : 8037804" 

/translation="MNLTDHYRAYIACLNDRSWHQLGNFVDDNVHYNGELISLSGYKT 
MLENDVRIIPDLRFKIDLLWEAPQVAARLVFNCSPQGRFLGQDTQGRRITFSENVFY 
T FSDGKI I GVWS VI DKAAVERQLRGE SAALCADRRV " 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 



11444 a 15435 c 14828 g 10788 t 



ggatccagcg 
tcgatgaaca 
acgccacggc 
ggtttgtctg 
ggctcgccca 
tcgagcgaac 
gcagtgtttt 
gagacacgtg 
acgtccgaac 
gcaccatcga 
aggctgaaca 
tcgtgcaggt 
gtgttgccca 
accgtgccgt 
gtcaggcgac 
gtgcgactca 
gggtgatgaa 
cgtaggccgc 
attcaggcaa 
agatgaagat 
ccagcgaacg 
ccgccacggc 
cttcgatctg 
aagcagtctc 
cggcagtcgg 
ggtcggcacc 
ccagcaacgg 
ccatctcggc 



gcgtattgtc 
ggtggccgtt 
gcacaccttc 
ctggcataag 
ccgacagacc 
gcttgcgcag 
cgccgtgcgc 
ctacgtcttc 
ccaggtcacg 
ccggagaggc 
cctgaaaacc 
cgccgtggcc 
gaatcatctg 
aggtgcccac 
cgcgacgggc 
tgcgtgatcc 
catggcatca 
catggtttcg 
atgaaaatta 
gtcggtgtcg 
cacgctggtg 
atcgaccacg 
ctcgacacgc 
gacgcccttg 
cgccgccaca 
ttcgtccggg 
cagcacttct 
ctcgccgccg 



gtggcgatgg 
gcgggcgttg 
gaccagatct 
gcccggcagg 
gccaatcgcc 
gttctcgtgc 
gaccttggag 
gtcggccggg 
ctggacctgc 
gaaggtcacg 
gccagagtcg 
cttgatgacc 
cgcaccggtg 
cggcatgaac 
cttgccgtcg 
tctggtgccg 
ccgtaactga 
ggataaccgg 
gtcaccaggg 
ccgctaaacg 
gtcccgaccg 
tcctggctga 
accggctgga 
gcggcaattg 
gcaccggcgc 
cggtctatat 
tcggcaaagc 
ccatcgatca 



aacgcgttac 
cgggtcggca 
tcgggcttgc 
taatccagca 
aggccgtcaa 
atgccaccct 
cgcttggccc 
tacggcgtgc 
atcgactctt 
ccctcctcct 
gtcagaatcg 
tcggtgcccg 
gcctcgatat 
gccggggtct 
gtggccaaca 
attcctgtgg 
agaagcggta 
cgaacgccga 
catcgaccac 
gcttcaactg 
caatcacccg 
cttccagcca 
acgtacccgc 
cttccatcaa 
gctgggcgta 
aaggaggcaa 
gcaactcgaa 
ggatcgacga 



3 others 

ggattttcag 
tgacacaatc 
ctacacccat 
ccttgatcat 
agccgatctc 
gaacaatgcc 
agcgcaacga 
actcatcgaa 
ccgggcccat 
tgatcttgcg 
gccctttcca 
gacgcagcca 
cacgcggcaa 
cgaccacgcc 
actcgaaaga 
ggccgtcggc 
cccgtgttcg 
aaccagcatc 
atgaaacggc 
gccatcacgc 
cccgccccgc 
ttcgctgtgc 
gccgacgtgc 
cggctggtcg 
aacggtctga 
cggcatatgg 
cagcgcgtca 
gcccggcttt 



cacaccggta 
gaacatatca 
caagtaacga 
ctcgtgcttg 
atccaggcct 
gaacagcgcg 
cagctccatg 
aatcatcacg 
gaacaccttg 
catggcgccc 
ctgcatgaaa 
caagtggaag 
catgcccttg 
acgcggaaag 
catacgacag 
gcgggattgc 
atggccgccg 
aacagcgtgg 
cgccccggat 
gcggcactct 
gcacggcacg 
atgtggtgat 
agagtgacaa 
aaatgcaggc 
taacgctcgc 
ccgacacgat 
tgccgcgcca 
ggcgacttgc 
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1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 



tggcacgcac 
gcttgccgcc 
acaccatcaa 
gcgcgcccgt 
gacgagcaat 
tcgtttagca 
atccctgttg 
ggaattggta 
ccctcggggc 
tggaaagtgg 
gcccaggact 
ctatgaacaa 
ttaaagtacc 
aagagaccca 
cgaaagacgt 
tattgcctat 
acaatagaac 
tcaccgaagg 
tgacctcgtg 
ggcaggcgta 
cacgaaataa 
gttgtcggtg 
tttgttgtcg 
gttgtccagg 
atccttgtcg 
gttgccgcca 
aaatgcaggt 
cgtcaggttt 
ggttactgaa 
ccgacagagg 
tcacacgact 
cagtgtcgga 
tttgttgcgt 
atgcctccgt 
aaggctacgc 
atcgagcaga 
caacaatccc 
tttcttttcg 
cgacgtgtgt 
cgcaggtggc 
tcaaccacag 
ctgacactta 
gttccgtaag 
ttgtaacgaa 
gcttcgacgt 
tcaggtacat 
cctgatacaa 
agcacagcat 
atctctgcct 
gccaacattg 
ccaaagcgtc 
cgagatgccg 
gtcatcggct 
gtagtgccca 
agcgatacgg 
gtgataaagg 
ggtctatacc 
aaaaagacct 
gcgatcttac 
aatgcttgtt 



gtgcgccagc 
ggacgccttc 
gtcgcccgag 
cggcccatca 
cagggaatcg 
gggccgggaa 
accaacggaa 
gacgcggcgg 
accaccattg 
tctgactgag 
gccttccagc 
gatcgtctac 
tacaggcgaa 
gtggcagcaa 
cgaagacgca 
attgtccggg 
tgaactaagc 
cgtgacgctg 
ccacggacgc 
ctaacgtgca 
cacggtaggt 
ttgttgtcgt 
ttgtcgagat 
tccttgtcgt 
ttgccgccaa 
cacgtggcac 
agcgaagtgc 
ttatacgcgc 
cacgttcgat 
tcgaccaaac 
ctcctaccga 
tggtttgacc 
ggcatgctaa 
caaatagtgg 
acgaggacat 
acgcccccat 
tggcttttcc 
tgaagatgca 
cacatccagc 
tcaccacctg 
gcaaccctgg 
ccgcaccggg 
cccaatccgt 
cctgaacgag 
aatccagata 
tcgctgagcc 
ggtcgatcag 
ccagtttttt 
gggcaccctg 
caaaggctaa 
gtcggacctg 
cattggttag 
gggagcatca 
gagtgcagct 
attcgtttgc 
cctgatgcct 
ttttgcaagg 
tgagtttcaa 
cctcctctac 
tcgttatggg 



acacgatggc 
tgcccgaaca 
cgcaaatgct 
agggtcaaca 
gggagttcga 
gtttatccgg 
aactcatcct 
attcaaaatc 
agaaaagacc 
gctgcgatct 
gcagagcgtc 
gtaaaagctt 
attaaaaagg 
accgggtggt 
gtggcgcaac 
gcttatgatt 
ccaggagacc 
gtggcgaaaa 
cgctctgccc 
caagacctgc 
cgcgttgcta 
tatcaagatc 
ctttgtcgtt 
tacccccaaa 
atgccgcgtc 
cggtgctgtt 
caatgatcgt 
gcatcaggtt 
cagtgactaa 
tgcagcctgt 
tgctgggagt 
ggttttgggg 
tcgatacatt 
acgccagtca 
tgctgagatt 
gccagccacc 
gatacatagt 
tttcgcaaga 
ccgggaagcg 
actgtcgaca 
cagatagact 
gcttatctgc 
gaaaaagtgc 
attcctcaca 
agcaaaacaa 
caccaacatg 
ctgaccttta 
tgaggtgtag 
aatatcactt 
agcccatagg 
attgtggctc 
ctcaatcacg 
gttggcaatg 
gaccagcgtg 
ggcaggggcc 
cagtacgcca 
ttaacgaact 
ggtctttttt 
tcgggttggc 
catggcgtga 



tgtccagcac 
aacgtgcggg 
cgagcaaatc 
gacgactgct 
aggtaaagtc 
tttgacggca 
tatacttcgc 
cgttttcgaa 
ttgaaattca 
accccacctg 
ggtacccgga 
acttcaaacc 
gctttttcgg 
ctgattgtca 
tcaatgctga 
atgcgctcaa 
agtcctatgt 
aatttcagtc 
cctgatacga 
ccgtatcagc 
ctttttagcg 
gcggtcattt 
accgccaaac 
tgccgcgtcg 
agtcacgttg 
gtcgttgtcc 
cagcgcaagc 
ttcccggata 
aacagtatgt 
ttcataccca 
accaaaaaac 
agaattgctc 
tatcagtgtg 
cgttgcataa 
cggctgggca 
cgttaactca 
ccagaaaagg 
cagggccttt 
ggggtgtaaa 
aggcggctcg 
ttgcctttgg 
gcggtaatgt 
ttgcgattca 
aaatcctgct 
tccagacctc 
tttgagcggt 
ttcatataac 
gcatccagat 
cccggcgccg 
gtcgtctttt 
gcgatacgcg 
gcgcactatt 
cattcgcggt 
ccgccatcga 
atgcccgcta 
cctggcttac 
gtcatcaaaa 
cgtttggtga 
cgttagcacc 
tacaagcggt 



gcgctcgacc 
aatgacacgg 
ggtgaattga 
gcgacgctcg 
agcgacgcgc 
ttagtaaaaa 
cgccattgag 
agaagtggga 
aggtcttttt 
cccggaattg 
tcacacgacc 
cattggggag 
cgacaaggaa 
gatagacggt 
cggttatgag 
ataccgatac 
cttcggctat 
gtctgcaagc 
aaacgccttc 
aagcgcaaga 
gcagacggcg 
ccaccgaaag 
gctgcatccg 
gtgtggtggt 
tcgttatcca 
agatcacaat 
agaaagccgc 
agtgaaaatg 
aactgcagcc 
tcaatttcta 
ttccgcactg 
aaacggagaa 
tgatgcggta 
aacctgacgt 
ttttcgctgt 
attgtctttt 
caaatccatc 
atccgtcacg 
tgccaatgta 
ggatatacgt 
ccctttcatt 
catccgccac 
aaaagtcaac 
gcgatgttga 
tgaagtcgat 
acggtgttcc 
ttttgttggt 
ttagtttaac 
gccccgaaac 
gcatctgatt 
agcaggctgc 
taccacgtgt 
ctcggcctca 
ggccgccgca 
ttgaatcggc 
aggcgggttg 
aacatggaag 
aaagtgatct 
caaagctacc 
aggcgtacag 



agaatctcca 
gtattgttga 
cgatgtgcca 
gccaacgggt 
atgatcgggt 
acctgcgtaa 
ccctgatggc 
gttcgattct 
tttcgtctgg 
gccgcggagc 
aaggataacg 
gaagtctcgg 
atcatgaaaa 
gaacggctat 
attcaaacgg 
gaaatacgtc 
ggctacagct 
tgaataatag 
ctcaacaaga 
cgctcgcctc 
tgccgttgta 
ccgcatcggt 
tatggtgatc 
cattgtccat 
gatccttgtc 
cgtttacggc 
cgatctttgc 
atgaagcaag 
ttctgcaaga 
tagcgaccgt 
catttttttg 
cgatgagttt 
tggcagcttc 
cactccaaaa 
ttacacaggg 
gccctgaaaa 
acctttctgt 
ataaagaaac 
atcaccggtg 
catgctacgc 
aaggcgtttt 
agggtatgcc 
atcgccaccc 
tcttcgaaac 
gactaattgt 
taaaaacgct 
gcgggcttcc 
gggtgttttc 
cccacaccct 
caccgtaatt 
tccattcctt 
catcggttgc 
gcagacgctg 
gaggccgccc 
tgactggccc 
cattgcaata 
cacaatcaga 
gactcaaccc 
ttcctgcgcg 
caggtccatg 



http://www.ncbi. nlm.nih.g0v/entre2yquery. fcgi?cmd=Reta^ 2/4/03 



5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 
6721 
6781 
6841 
6901 
6961 
7021 
7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 
7621 
7681 
7741 
7801 
7861 
7921 
7981 
8041 
8101 
8161 
8221 
8281 
8341 
8401 
8461 
8521 
8581 
8641 
8701 
8761 
8821 



agtctcggga 
tcaaggcaac 
ccaaaggcat 
agcagatatc 
atgagtcacg 
tacgtccggc 
attacagcga 
cagggttgag 
tggaaaagca 
gaggtttgat 
cgaccaagaa 
agtatttctt 
gcctggtggg 
gtgaggaaac 
atcaggatcg 
gcgagaaaag 
gggcgaaggt 
ggtttcgcgg 
tttggatggt 
cgccttggaa 
ctatctgatc 
tacttcagac 
gctgtttcca 
ccttatagga 
cgacggtcgc 
actctgccaa 
cccacttacc 
gggcacgctt 
caacgcgcaa 
ggagctcttc 
caggcatcgg 
cggcgcagag 
tattgatcag 
gcgcatcacg 
acggcgccag 
caatcagttt 
tgtccatttc 
cgtcggccag 
gcaacaggat 
gtgccagcgc 
ccttgaagtt 
tctggccgcc 
ttacgccatc 
tcatatccgg 
gcgcaacgct 
cagcttgcgc 
caccgacggg 
acgctgcccg 
aagcaccgga 
aatgatcatg 
tggaatcgcg 
gccgacggcg 
gcccatcatc 
gctgtcgtaa 
tgaagcaaag 
agtcaacatg 
ggagtcttgc 
atcagggggc 
gctggcatca 
ggtcacgtag 



acctgattga 
gcttccctga 
ttgcagagag 
tttaagtttc 
cttatgtgtg 
ctatccgctg 
tccggcgatg 
tctggatcga 
tgagttggca 
gctgcgccaa 
caaggacggc 
cggcatgaaa 
tactgcggcg 
ctatgtcagc 
ccagatgatc 
tttgatcgca 
tgaacatccg 
gctggctaaa 
gcgaaaacgg 
aggtgctgtt 
gggcgagttt 
cttccttaac 
gtaaactgac 
aagcctcacg 
tcgggacttg 
ccctgggact 
ttccggtttg 
tatctcagct 
cccggcgtgc 
cggggcctga 
actgcctttg 
gtgctccata 
ttcacgcttg 
taacagctcg 
ctcaggtaac 
cttgtcatta 
aaggtaattg 
cttgagacct 
gctcacaaca 
ataggggtgc 
cgtatcctgc 
gtgcacatga 
ctttccgcca 
atagcgcccg 
ggctgctgga 
agtgggacgc 
tctcatgatt 
ggccccgaca 
tcaacgttat 
gaaataccga 
gcgcctaggg 
cgaccgatca 
ttcgccccgg 
atggcctggg 
cccacgatcg 
gacggtccgc 
agcgtaccca 
agcacgctgt 
gactttttct 
gacgtggcaa 



gagccgctct 
ccttgagcac 
aggacagcaa 
ataacaacca 
gcgactcatc 
atggcgatgc 
gaggaagcac 
atcgccgatg 
ggcgggattt 
ggtatggtgg 
aaacgcgatc 
gcgcatatcg 
aatgtggcgg 
ggtgatgcgg 
tggtcaattg 
cgggtctatc 
cttcgcgtga 
aacaccgcgc 
ctgctggcga 
tgaaggaaaa 
ttttgaacct 
ctcagtagcg 
aaatttcatg 
tctggattca 
aggcccagat 
ccgtcacagg 
ctcagcctta 
ttctccgggg 
ttggccgctt 
tgaccctgac 
ggcgcgctca 
aaaagcgtgg 
ctggacgttg 
cgcatgacac 
ttttgcgcgt 
ctgtcaggtg 
agatttatat 
tgagcctcaa 
tttgcagaca 
gggctctcac 
aatgctttgc 
ctgctgcctg 
ggctcctgcc 
gctggaaagc 
gaggaactgg 
tcagcttcat 
gaatctccgc 
tttcagtcaa 
ggtcgaacgc 
aacctaccgc 
cggcacctgc 
tctgtccgcc 
cgtcgatgcc 
ttgcgccaag 
agttgagcgc 
cctcatcgct 
acgtgctgtt 
cgtaaatggc 
ggccaagcag 
tcttgtcttg 



gcgctgtacc 
cacttagctg 
agctggccaa 
cctttgttga 
gaaatcggtt 
tgcggattca 
tttacgagac 
aaaccacgat 
tgcaggtcat 
tcgatgcgac 
ccgaaatgca 
gcgtcgatgc 
acgtgactca 
gctacaccgg 
cggcacgccc 
gcaaaatcga 
tcaagcgcca 
aacaggctac 
tgggagaggt 
tcgatgagtt 
caggccatga 
aggccggata 
cactgccgcc 
gcttgccgcc 
catcgatcac 
aagtggcgtg 
ttttttctgc 
ccatataggt 
ccaccaaggt 
tctgcaaatc 
ggtaattatt 
tcacgccact 
aattgtgacc 
cgagactgtc 
tgaaatcatc 
cccataaacc 
gaaggtggtc 
ggcgctgttc 
gttggctgct 
accagcgcgc 
tttgagctga 
ctgcgtccgg 
cctcgatttt 
gcttcaggtc 
cctgtgaaga 
aggttggcgg 
gtacgaaaaa 
tcaatgcgcc 
cttctgcgcc 
cagggcgcca 
ggcaaggccg 
cagacgccct 
acctttaatg 
cttgtcgcca 
cttgccgccg 
tttgccttcc 
catgtagttt 
tttctggtta 
ctgcttcagt 
cagatcgggt 



cccctggcct 
ggcgccacca 
tgcaatgaat 
tcagaattgt 
ccaatgcaag 
cctgatgcag 
aacgatcctg 
tctcaatttc 
caatggctat 
gatcattcat 
tcagacgaag 
cgagtcgggt 
ggtcgatcaa 
cgtggacaag 
aagccgttat 
gttcacgaaa 
gtttggttat 
tctgtttgcc 
gcgcctgtaa 
aacagcgcaa 
aggcatcaaa 
aacgagtccc 
cgcgtgttca 
gtagtttttc 
cagactgcgt 
cgttgccccg 
tgcgtagtaa 
ggacgttgta 
ggtgaagtta 
gaggtagttt 
gagcgccttg 
ggccttcaag 
ctcaccaata 
cttgcttttc 
aaaataacgc 
cttggacgtc 
ccgaccttcc 
aagggcgtgc 
tttccccgct 
gagctcggca 
agccgaggtc 
cttacgcctt 
cagccggata 
ccccagcatt 
tcgggcgcga 
ataatagcct 
tagtgccgag 
ttcgcaatcc 
ttatgctttt 
tcgattgccc 
gtggcttcac 
aggccggctg 
gcctcggcgc 
tgagcgatca 
acgcccgcct 
agaagcttgc 
tcatgctgat 
tcggcggttt 
gcaccgcctt 
ttgttttcaa 



gagccactgt 
tcggcatgca 
tttgttttag 
tgaagaaatc 
atgggatttt 
aactggtttg 
cgccagttcg 
cggcgcctgc 
ctgggtgatc 
gcgccgagct 
aaaggaaacc 
ttagtccata 
ctgctgcaca 
cgtgcggagc 
aaaaagcatg 
gcccagttgc 
acgaaagtcc 
ttgtcgaacc 
tgcggaaaaa 
aaacgtctga 
aatcgatgct 
tttctatgat 
agcgctcaga 
acattgatat 
accccatgca 
acaaaagcga 
ttcatggctt 
tccagcgaga 
tatttcgtgt 
ttcagcctgg 
tcatgtgact 
ctcttcatgt 
acaagccccg 
atcttcgtca 
gctgccttgg 
cccagacaac 
gagacaacaa 
ttgccttctt 
gcttttgagg 
agatcgctcg 
gaggccacgc 
ctggtgtgct 
ttttctacct 
ggagtctctg 
tcgtttcctg 
ggagccggtc 
cccgggcgtg 
cgaactgatc 
tcacagcatc 
agccgaccac 
cggcaaccat 
aggcttcgcg 
ccatcctcgt 
ggctggacac 
cggcgagctg 
ggcctttttt 
tttcggtgaa 
gcagagactg 
cgctgaagtt 
gcacctgatt 



http://www.ncbi.nlm. nih.gov/entrez/query.fcgi?cmd=^ 2/4/03 



8881 
8941 
9001 
9061 
9121 
9181 
9241 
9301 
9361 
9421 
9481 
9541 
9601 
9661 
9721 
9781 
9841 
9901 
9961 
10021 
10081 
10141 
10201 
10261 
10321 
10381 
10441 
10501 
10561 
10621 
10681 
10741 
10801 
10861 
10921 
10981 
11041 
11101 
11161 
11221 
11281 
11341 
11401 
11461 
11521 
11581 
11641 
11701 
11761 
11821 
11881 
11941 
12001 
12061 
12121 
12181 
12241 
12301 
12361 
12421 



ggtagtgggt 
accgctgatg 
cttgtccata 
tgccttttgc 
attgaggtac 
ctgctcggtc 
ctgcaacttg 
tacgtccttg 
gctgagcatg 
cagatttttc 
gcccgcttcg 
gagcgctccg 
attaccgtca 

cgccagtggc 
gtcagcgtcg 
cttttccata 
gccaccaccc 
catggccttg 
attgagctct 
ccaacgtttc 
ggctttgggg 
aaccggagcg 
gggttgcggc 
gagcaaaaga 
gttgccggcc 
caaggctttg 
cacgataacg 
tggcaactgt 
caggcgaacg 
acggtgccaa 
agcgacactt 
cgcgccagtt 
gaatgcactt 
ttgcggatca 
aacttgtgct 
tcagggttca 
atttggcgaa 
atcttctgaa 
tgccagctta 
ctttgtaata 
catcggtcag 
cggcttagaa 
cgccgacccg 
cagggacttc 
tcctcattca 
tcggtgagct 
gggtattgcg 
attacaccga 
ccgaagccga 
gtcgcaagta 
caaatatcga 
gccagccaaa 
aattcaacat 
cgccgtcagc 
acgtgaccac 
ccgacctgac 
gaatgaatgt 
accaattggc 
ggcgcgacat 
tcaccactca 



actttggaat 
gcgccgctgt 
tcagtctgca 
agcgaggcat 
gcttgtacgt 
ttgtcggtgt 
accatgacgg 
tccagcttgc 
ctggcgaact 
gcgtcgaaca 
tctacctgac 
gacaaccctg 
actttggtct 
atattggctc 
gggttgtcct 
ttttttgcga 
tgagcaacgt 
ccggcatcgg 
ttcgccgcct 
agagcttctg 
gcgttggaag 
gccggggcag 
tggacctgat 
gccaggatag 
ggctgaccga 
gcaagaggcg 
aagggagaac 
gtggttgaag 
ggtcgatttg 
tcggcacgcc 
ccagcacttt 
gccgatgccc 
cgtcttccca 
ggttcagcgc 
cgttacgcag 
tcacccgctt 
taccggcgga 
acatgggctt 
aaaatagaaa 
caaaactttg 
cttcgcaagg 
ctgccctatg 
cgtcgtgacg 
gccacaacag 
gcgcgacctg 
ttatcaactg 
caaaaagctc 
cggtgatgcc 
cggtgagatg 
cggtccacgt 
taacaagcgt 
cattaccggc 
tgacctgcgc 
ttctcacgag 
gctgctgaaa 
ggtcgacccg 
gaaccagacc 
gttcctcaac 
gaaaagccgc 
ggaacaaaag 



cggggaacag 
aatcggcatt 
gcgcttgacc 
cactgcggac 
caggatcagc 
tgcgcaggct 
ccgctttctg 
tgatatcaat 
ggccgccgtt 
tcttatcagg 
ccaaaaagcc 
ggttactttg 
tgtccgctgc 
gcatcactgc 
tggtgtagtt 
aggtcttgag 
ccacggcggt 
ggccatcatc 
gctcgcgctc 
gcgaggagta 
cgtcggttgc 
tcgcttcagt 
tattcacatt 
acgacgcggt 
actgcacgcc 
actcaacagc 
tggatatacg 
gtgcaagttg 
ctgcttgagc 
gaggctgttc 
ttgcatgttc 
ctctacctga 
gctttcctga 
gatgccacac 
ggcttcaaga 
ttggataaac 
cgaagatggc 
accctgattt 
taactgattg 
cgcaaaccgt 
gaactgatcc 
aaaatcgtcg 
cctgctgcgc 
gtttcgcgct 
atcgccagca 
ctgatgaccg 
atgcaggatg 
gccaaggctc 
ggcgagcatg 
gctcgcgccg 
cgagtcgccc 
ctgatcgaag 
gatatgcgca 
caactgcgca 
ggttgcgagc 
ccagcctttc 
ctgcaactga 
gggttgcgcc 
cagaccgcac 
cagctctacg 



gtctttttgc 
cggattatgt 
gctattgacg 
cagattgcgc 
ctgtagctgg 
gcgcccggcg 
tgcaccgctg 
cccggccacc 
ggtgggtgcg 
gctgtgcgca 
tggttgcgac 
tgccaacccc 
atccactgca 
cgcgctgcgc 
ggccaagtcc 
atctttgttc 
cttcagcgcc 
acgcgccatc 
ttcgggcggc 
ttcagaattg 
atctgtgttc 
cggtgcagcc 
ggcattggca 
ctgctcggct 
ggcttgccca 
cagagccagt 
catggtgagt 
gttccagaaa 
tgaacccgcg 
gctgtttcct 
gacggcaggc 
tgactgacat 
tacggctgac 
agccaggtct 
aacacgcact 
gccctgagca 
tgacggggct 
agtggtgtac 
atttataaag 
tttttgccac 
gggaccgtga 
ctccgcctat 
agccacttcc 
tcgcggccgc 
acaatgcctt 
atcaggacac 
actcttcctc 
atgtggtgtt 
tggtcctgac 
ggatcaacag 
tgcgcaacct 
cattgctcgg 
ccgcgatcgc 
cactgatgca 
acctgctggg 
tcaagcactt 
ctcaacacat 
ccatgctgct 
tgagcaacct 
gggggctcgt 



agttgcaact 
ttgttgacgg 
tttttcgtct 
tcctgctcgg 
gaaatccggt 
ataacgcttt 
taagacttgg 
gcattgagcg 
cttttcttga 
gccttcttgc 
caggtgctgc 
ttcaggtctt 
tgatgtgggt 
accatttcca 
ttgtcggcac 
gtgatcttgc 
gggttggcgt 
catgccgctg 
agatgggcaa 
tcgagaaagg 
gtgggagctg 
tcggcaggag 
gctgccccgc 
cctgtcggcg 
ccgccaccca 
tcgccaggag 
tgccatccga 
aaatgatcga 
cgccggacaa 
gataattgcc 
aatcaatggc 
caccgtgccc 
gatacatttt 
gcggtttgct 
ggagaatgtc 
tctgaatctg 
gggttgagtc 
aagccctata 
atattttaaa 
aacagctttt 
cccactcagc 
catgcgaata 
caatgctgat 
gctcatccag 
gcagtcgcga 
cgggctggat 
agatctgaaa 
gcaagccgcc 
acagcagctc 
cgccaaggca 
ctatagcgtg 
cgagcagcaa 
cgacgacctg 
cggcctcaac 
ccgtatgcgc 
gctgaccctg 
tggtggcaag 
gcaactgccg 
gctcatcctg 
ctgatgaatc 



gggcggacaa 
ccttgtccgc 
gctcgacgac 
gaatgctttt 
cgttcaaacc 
gctgggtctg 
gtttgtcgaa 
tcgcagaatc 
tccactcact 
gccccgacat 
aggactgttt 
ctgcgtcgac 
cggcagcaat 
gtgactgcgg 
tgtctgcggc 
catctgcgtt 
tgatgaaatc 
caatcgggcg 
ccatcggttc 
ctgcgtctgc 
cgacctgttc 
aatctgcgca 
cactgccctg 
cgccttgcgt 
caggtgtcgg 
tgggttggtt 
gagtgagcga 
gatcgccatt 
gcgtgagcga 
gtccatctcc 
ctgaatgact 
ttccagctcg 
gcggaagtga 
ggcatgttga 
atccacatca 
atcggccgtc 
gaggatcaca 
gcgataacca 
atgtgaaatt 
ggcacaaaaa 
ggacataatc 
cttccggtcg 
ctgcacaatt 
cacagacgca 
gcggtgaagc 
aacgcggcca 
caggttctgg 
aggaagcagg 
aaacagctgc 
ttcgcgcggc 
gcggtttccg 
gaagccgggc 
tccgcgatga 
acggcacgcc 
aacaagaacc 
acggccaatg 
cagctcaagc 
atcctgctct 
atggctgaac 
agatcatcaa 



http://www.ncbi.nlm. nih.gov/entrez/query. fcgi?cmd=Retrieve&db=nucleotide&list_uids=1332„. 2/4/03 



12481 
12541 
12601 
12661 
12721 
12781 
12841 
12901 
12961 
13021 
13081 
13141 
13201 
13261 
13321 
13381 
13441 
13501 
13561 
13621 
13681 
13741 
13801 
13861 
13921 
13981 
14041 
14101 
14161 
14221 
14281 
14341 
14401 
14461 
14521 
14581 
14641 
14701 
14761 
14821 
14881 
14941 
15001 
15061 
15121 
15181 
15241 
15301 
15361 
15421 
15481 
15541 
15601 
15661 
15721 
15781 
15841 
15901 
15961 
16021 



ctttctgaac 
cgtgattgcc 
gatcgcggtg 
gccgctggcg 
gttgtcgatc 
ggccttcggc 
cctgacggtg 
cgcccgattc 
ggccaacctg 
ccaactgttc 
cctgatcatc 
catgaccgcc 
cgcgcagatt 
caacaccgaa 
ccagcccaag 
tatgccaacc 
actgcaaagg 
tgaatggcaa 
tcaccctggc 
caaccggctg 
ccatctgcag 
cttcacccag 
cgccatctcc 
aggcggcgag 
gctgcaatcc 
gctggaatcg 
gttctcggcg 
gattgccgaa 
ctatgtgcga 
gcaggtgtgg 
acagacagaa 
gcatatcgcc 
gcgcagcccg 
tgcggaaatc 
cctcgaaccg 
tggtcagatg 
gatcggcgag 
tggcgtggaa 
tgaagagggc 
caacaccgcg 
gccgtcggtt 
cgaagcgccg 
aatcgccggc 
catcgtgctc 
gccgaacgct 
caagcgcgtt 
cgaccgcatg 
gcatctcaag 
ttatgactca 
tgttgtcgtg 
catgtatgtg 
gcagttcgat 
aaagacaccc 
agcgcggtgc 
tgcgaagtga 
gaatgcacgc 
cgcccgctcg 
gacggtttcg 
cgccgcgaca 
acccggtctt 



atggtcgcct 
atcgtattca 
aacatctgta 
ttttcaacct 
tcgaccacgc 
cagttcgtcg 
gtcaactttc 
acgctggacg 
atttcggtct 
ggtgcgatgg 
gttgccatca 
ggcgacgcct 
ccggcgctgc 
gcaggcgttg 
gcctggatca 
ggcgtgttca 
gccaaaccca 
gggaagactt 
cagaacagcg 
gtggtgcagt 
ccagacgaat 
acacacgtgg 
ggcaccacgg 
ctggccaccg 
tgtggcccgc 
gagcagcctg 
gttttgcagc 
acactgatcg 
atcgccctca 
ttgctgacac 
acgttttttg 
tttccgttgc 
ctgcgcacct 
agcaacgcag 
atggacaacg 
tttgaattac 
cagtggtcga 
agcctgcatt 
gcggtgtgcg 
ttcatgctcg 
ccggccgtta 
ctggaaaagg 
ttgctggtag 
gaccaaggcc 
gcgcccaaaa 
cgcctggcca 
ctgactgacg 
gaagagtcgc 
cccgtgacgg 
cagatcatga 
ggcgatgaaa 
ggcgaccgac 
acgcagcgcg 
gcgggatttt 
gcaaagccga 
tgctcagcgc 
gcatcgccca 
gacggccgct 
cgctgccggt 
tgcctaccgg 



tgtcggccat 
tgatgatcac 
tttcctgcct 
tccctgcggt 
ggctgattct 
tgggcggcaa 
tggtgatcac 
ccatgcccgg 
acgaagcacg 
acggggcaat 
acatgatcgg 
tgcagctcta 
tgatctctgt 
aagccaacat 
ttgccgcagt 
tcaccattgc 
aggcagaaga 
tgcgcacgtt 
cgctggtcga 
tcggcctgac 
ttcgtttcac 
cagtggatgt 
atcgccagga 
tcagttccat 
agttcatcgg 
agcttgctca 
gactggcctc 
agcattgcca 
agtcccagat 
cggaaagcga 
cgctgagcaa 
gcgcgcctga 
tgttgcgaga 
ccaaggtcag 
agcatgccgc 
gcgtactgaa 
tcggctctgc 
gccgcctgca 
atgaagaggg 
gttcagtgtg 
tccccaagca 
tcaagtcgcg 
gcgtgatcgg 
cggcgcacat 
catcgcaacc 
gcatggaagc 
tcagcgtggc 
tggtggtcta 
tgcttgataa 
ccgggccgca 
aagacggcct 
acattgaggt 
cttgagccag 
gctggagtgc 
cggcagtttt 
cctcggtgcg 
tcgaataggc 
gctcggtgat 
gattgccgac 
catacgggcc 



gcgtcgctca 
gccactgccc 
gctgatcatg 
gctgctgctg 
gctcaaccag 
cctggcagta 
caagggctcg 
caaacagatg 
caaccgccgc 
gaagttcgtc 
tggtatttcc 
taccgtactg 
tacctgcggc 
cggtcgcgaa 
ggcgatgctc 
gatcatttgc 
acaaggcgcc 
ctcccccagc 
tgcgctggtc 
gctgccctcc 
ggtctacgac 
ccgccagttc 
agatcagtgg 
gaccctcatc 
cttgcaggaa 
ggaaatgcag 
cgagtgcgtg 
gcacgagcgc 
ttaccaccag 
aggcttgctg 
cgacatcagc 
acaggcggtg 
agagttctat 
ggtcatgggt 
ttgagcctgt 
cggccagcac 
tcaacagcag 
acgcctggac 
ccattcacag 
gttgtgcgtc 
gccgcaggct 
ctcgcaattt 
cagcgcctgg 
cgcctcagcc 
tgcagcgaaa 
tgtgcggcat 
cgaaacgcca 
tcagcgcatg 
cgtcgccagc 
tgcgcatctg 
gcgcctgacc 
gacctggtga 
tactccgccg 
aagataccgg 
ctgctggctg 
ccggacggca 
gtggacgaca 
tgcctggggg 
gccttgccgc 
atcgacagcg 



gaggtggtcg 
accgggctgg 
ctggctatgc 
accaccatgt 
gacgccggac 
ggcctggtga 
gaacgggttg 
tccatcgaca 
tcggaactga 
aacggcgacg 
atcggcgtgg 
accatcggtg 
atgatcatca 
atcgccgagc 
ggcttcgcgg 
ggtgccggcg 
gttgcggtag 
cgtcagtttg 
agtgagatcc 
ttcatcatcg 
gtcccgatgc 
aacggtgagg 
gtctggctac 
accgaacgca 
accaaagcca 
cgagtgctga 
ccattgcggg 
gacaccaacg 
tactgcggtg 
cgtgacggtc 
cagatgctcg 
ctgctggttg 
cacgttcccg 
cgcttcgacc 
caggtttgag 
cagggcgcag 
gacctggcgc 
gacagctgga 
gccagcgtgg 
tcgccagccg 
gaggccgaac 
ctcaaccgca 
agcctgacac 
atgacaaccg 
cccgccggcc 
cagctcacta 
gagggattga 
ctgcagcgtt 
gcgcgcaata 
gtgaccgccg 
aaaatcgacg 
acgccgcact 
tgcgcgtcag 
cagccaaggt 
aaatcgttgg 
ttcaggtcgg 
cgttgctggg 
cctttgccgg 
caacccagcg 
ccattctgct 



gcgcgttctt 
tcgacgttct 
acctgccaag 
tccgtctggc 
acatcgtcga 
tcttcctgat 
cagaagtggg 
gtgacttgcg 
acaaggaaag 
ccattgccag 
tgcagcacgg 
acggcctgat 
cccgcgtacc 
agatcaccag 
cactgccggg 
gcatgctgca 
cgcccggaga 
tcctgcagtt 
gtcaacgacg 
agtacgtgga 
tcaaggcgac 
acgtgcccgc 
cggccgagca 
tggagcgcgc 
ttctcggctg 
ccctcacgcg 
cgatccgggt 
tgctgaccga 
ccgatggctt 
tgcgccagac 
tgcaacagct 
cacaggatct 
tactgtcttt 
ttgaagatga 
catacggaga 
cactgcccct 
tggacgattc 
ccctcaacgc 
acctgacgct 
gcgatgagtg 
ctgcacgtca 
ccaccgggat 
gcccggcagc 
cgcccggcat 
cggcgactga 
ccatgctcag 
tgctcaatgg 
tcaaggaccg 
ccctgccgtt 
acggtcgccg 
atcagcgttt 
gaaccagtgg 
tggccgggtg 
cggcgatctg 
cttcacccag 
tgcgccgatc 
ctgcgtgctg 
gccggatgat 
gccgcgcatc 
cggtgaaggc 



http://www.ncbi.nlm. nih.gov/entrez/query .fcgi?cmd=Retrieve&db=nucleotide&list_uids=1332... 2/4/03 



16081 
16141 
16201 
16261 
16321 
16381 
16441 
16501 
16561 
16621 
16681 
16741 
16801 
16861 
16921 
16981 
17041 
17101 
17161 
17221 
17281 
17341 
17401 
17461 
17521 
17581 
17641 
17701 
17761 
17821 
17881 
17941 
18001 
18061 
18121 
18181 
18241 
18301 
18361 
18421 
18481 
18541 
18601 
18661 
18721 
18781 
18841 
18901 
18961 
19021 
19081 
19141 
19201 
19261 
19321 
19381 
19441 
19501 
19561 
19621 



cagcgggtcg 
gcccgcaaca 
ctgcgcgagt 
tgtgccacgt 
atcgccgaag 
cgatttgccc 
ggtctgccac 
gaaaacggtt 
cccgtggccg 
gccgagcgcg 
aacgtcaccg 
tacaagcaag 
accgattgcg 
gagcctgtgc 
gagtaatgat 
tcagcctgct 
aggcgcaggt 
accaggaacg 
aaaccatcag 
gcctggcctt 
cgttgcttga 
cctgcatgga 
ccccggccaa 
ctcttagcga 
atgtacagac 
acggcatgtt 
gctttggcgg 
tcgacgagct 
tgatgcccag 
tgcagttggg 
gccgcgactc 
tttcagcatg 
tgagctgggc 
cttgctgccg 
cgtcggggcg 
cctgacgctg 
cccgaccatt 
tactgtcggc 
cgcaacgccc 
acctctcgac 
gcttgaacaa 
cgacagcgcc 
cagtcaggac 
ctgaagatct 
agcaggagca 
atgcagaggc 
ctggcctcga 
tcgccgagtt 
gttatgccac 
atggccggct 
gaacccgctc 
cgtctgcacc 
cgtgcagcag 
cgtcatggcc 
gggcaatacc 
catgacccgt 
gtggcccaag 
gttcgtcctg 
catcgttatc 
gcccatgacg 



ggctgttcgc 
tggactgcga 
ttctcgacca 
ccgatcgatc 
cctttcgcgc 
gtgcgcaacg 
cctcggtgta 
cgatcaccgc 
atgaagtacg 
gccactaccc 
gtcgcgatca 
tggaaatgct 
cggtgcaact 
ccctgcagga 
ggacgaaaca 
gacgccgctg 
ggaactgaaa 
tgacaaccac 
cctgaacgat 
tatccgccag 
acacaaacgc 
ggagacgctc 
agcgccacca 
aaaaccattc 
cagccgttcg 
cttttcgcaa 
gggcggtata 
ggcgcagcgc 
cctgggcagc 
ctttgcccgc 
gctgtcacaa 
agcgccttgt 
gccggtcaat 
ctgatcgcct 
ctgtgcctga 
ggcggcgaac 
gccgatctgc 
tgccggattc 
gagacgctgc 
gagcgctggt 
ctcgcttcgc 
ggacagggct 
cagcatcttt 
gtatcaggat 
cgagcacgag 
cgaaccggat 
ctctctggct 
gcggcggctg 
cttgtgccac 
tggcttgcag 
c.tgctggccc 
gcgtttctca 
gtgccaccaa 
ccggtggccc 
gaaaaactgc 
aataccgacc 
gaaatggcag 
tcggagctgc 
gacctgattg 
ctgtcgctgc 



cggtgccggt 
cgtcatcgtc 
tgaactggac 
cagtatggaa 
gcggggccag 
ggaaatcggc 
caccctgctg 
gctctacacc 
ctcgttgctc 
cgctatcgac 
ccagcgcgcc 
cttgcgcctg 
gaacgacgcg 
aaccctcgac 
ctcgaggaag 
cgccagcatc 
tcgatgctcg 
aagcgacgtc 
gtcgaccgat 
gacgtgcagc 
ctgcaagcca 
aacgaagaag 
gcaccccagc 
ggtgaaaaac 
gtactgagca 
ctgctggtgc 
gcctttccgg 
ctgccggatc 
gttcgggtca 
cgcgatgtac 
gcgcttgggc 
gcctgcgcaa 
gcttgagctt 
ctgacgagac 
gcgatgccga 
agcaagcctg 
tggccccggt 
aggtccggct 
tcacgctgct 
cgattagcac 
tgcgtcctgg 
ttctgaccct 
tattgcggct 
gatgtcgaaa 
cacgaacaac 
gacgacgagc 
ctggacctga 
gacgccggga 
ggcgagcgcg 
attacccggc 
tgtttctcgg 
agattgccat 
acatggcgct 
atgaaatgca 
aagccagcgc 
cggatgtcgt 
accaggcgag 
aagccggctt 
tctccaacct 
cactcaagtt 



tgcggcaaga 
ttcgggctga 
gaaaccctgc 
cgcgcccgcg 
aaggtcttgc 
attgcatcgg 
ccacggctgg 
gtactgatcg 
gacggccata 
gtgtccgcca 
aacaatcgtc 
ggtgaatacc 
atcaatgcgt 
gggttgctgc 
acccgcaacg 
gtcaggcaag 
accacttatc 
gcgagagcct 
ggcatgagaa 
aacagcaatt 
aagcgtctca 
gttaagtgac 
gccctgcgcc 
gccctgcatc 
agtcgctgca 
cgcccgtaag 
tacgcgcaga 
agcctgacgg 
atgccaacaa 
tcaagcgtct 
aggacgttga 
ggtcgacccg 
cagcgccaag 
acctgccaag 
agcactgctg 
gtactggcag 
tgagccgctg 
ggccgatcaa 
gcgttccgcc 
gccgctgatc 
cgatgtgttg 
ggcgggacgg 
cagccacgag 
gtctcgaaga 
agtgggccga 
aagaggagca 
cgctgcgttg 
ccatcctcga 
tggtggctga 
tggcggccca 
ggccatgtcg 
gaccctgttg 
gtacggcatc 
gcagcgtgtg 
cagaaccgtc 
cgctcacctg 
caagaacgac 
cgagatcggt 
gctgctggcg 
gctgctgttt 



ccacgctgat 
tcggtgagcg 
gcgctcgctc 
ccgctttcac 
tgctgctcga. 
gcgaaccgct 
tggaacgtgc 
agcaggactc 
tcgtgctgtc 
gcatcagccg 
tgcgccagtt 
aggccggcgc 
ttctgcgcca 
gcatcacctc 
ggaggctctg 
tgccgagcga 
agagacccgg 
gtcgcaggac 
agaaaaaaac 
gcgggtagcc 
gcgcgcggtc 
gatgaccgca 
tgtcgagcgg 
cggtggcctg 
agacaccccg 
tgccgaacct 
acaagtgccg 
ccccctcagt 
aaccgaaaac 
gagtgcccat 
gctggacatg 
ctgctggcta 
ggccaacagc 
ggcgtctggt 
agcctgctgg 
tttttcaatc 
ccccaggcgt 
caggtccatg 
gcctggcaag 
cttggcgagc 
ctgcccgcca 
caatgggcag 
gagcacagcc 
ctatgacgat 
gccggacgac 
agaagagcaa 
cggtgaattg 
agtgggcggc 
aggcgaactg 
gccatgatca 
ctgatccctt 
atcacccgca 
gctctggcgg 
catgaccacc 
atcgaaccgc 
ctcgacaaca 
ttgctgctgg 
tttctgatct 
ctgggcatgc 
gtcatggtgt 



ggccgaactg 
gggccgcgaa 
ggtactggtt 
ggcaaccgcc 
ctccctgacc 
gggccgcggc 
cgggatgagc 
gatgaacgac 
gcgcaagctg 
gatcctcagc 
gctggcggcc 
cgacccggtg 
ggacctgcgt 
gcaactgccg 
gaacaggtga 
gcacatcgcc 
gcgtcactgg 
catctgcaaa 
atgctcgacc 
gaacagcaga 
gagaaactcg 
ccgatcaaaa 
ccagccacag 
ggccgtggcg 
atgagtgccg 
ggtcagcaag 
acccaactga 
ttcaccctgc 
cgctggagcg 
acgggcgcat 
catgaagacc 
aggtctcgcg 
aggagctgac 
tgaacaccgc 
gcgatatccc 
aacgcctgag 
gtgcagagct 
cccacctgca 
cccggttgaa 
tatcgctgac 
actgcaagtt 
cccagaccga 
accatgagta 
gaaacggctg 
gagtccgaat 
caagcaccct 
cgcttgacgc 
gttgcgccag 
gttgatgtcg 
tggaaggagc 
ttttgctgat 
atgccatcgg 
cgaccatgtt 
cactggagct 
tgcaacggtt 
cccagcgcat 
cgatcccggc 
acatcccgtt 
agatggtctc 
cgggctggtc 



http://www.ncbi. nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db-nucleotide&list_uids= 2/4/03 



19681 
19741 
19801 
19861 
19921 
19981 
20041 
20101 
20161 
20221 
20281 
20341 
20401 
20461 
20521 
20581 
20641 
20701 
20761 
20821 
20881 
20941 
21001 
21061 
21121 
21181 
21241 
21301 
21361 
21421 
21481 
21541 
21601 
21661 
21721 
21781 
21841 
21901 
21961 
22021 
22081 
22141 
22201 
22261 
22321 
22381 
22441 
22501 
22561 
22621 
22681 
22741 
22801 
22861 
22921 
22981 
23041 
23101 
23161 
23221 



gcgtctgctc 
tgttcaagca 
tgctggtggg 
tgccgttcgg 
tcggagtcga 
tcgtccacta 
tggcaatggc 
tgaaaggccc 
tcagccgcgc 
aggaagtcgt 
cctcggtggg 
cgtcgctggg 
tggtgctcat 
tctggccgcc 
gtcagttgaa 
tgctgttgat 
cgattctggc 
cgacgctgct 
tgggctttgt 
ccgcgacgcg 
tcttctggct 
ggctttgctg 
gatcgccagc 
gatgctcatg 
gttgaagctc 
gcagaacatc 
gtatacccag 
cacttatctg 
gctgctggtc 
gcgcatgagc 
caagggccat 
gaaaccggtg 
ttactaccgg 
tgccctggcg 
gctggcgcgc 
ggccgtcggc 
tcgtatcgac 
gaatacgcgg 
cttgaggaac 
atgctgcgca 
cgataatcca 
ggctctgtga 
tgacgacaac 
gtcttgagag 
gttgcaaggc 
catgaacacg 
cagcatcaac 
gaccgcgtgc 
acgctctgac 
gcggccttct 
gacgccattg 
cgacttttcc 
tcatggtccc 
cggttgttgg 
gtcagcgtag 
cccggagcag 
cgatgtacac 
cgatgaccca 
tacgcgcaac 
catcaagcgt 



gacagcctct 
aggcatgttt 
ggtgatcacc 
tatcaaactg 
actgatccag 
gatgcctctg 
gcgtctggtg 
actgcgctat 
actgacctca 
gctgggcacg 
cgctctgctc 
gcctgacgct 
ttcaggcggt 
cacttcgtgg 
ccagaccctg 
cgaggccgcg 
catgccggcc 
ggaattgggc 
ggtgcaggtg 
cgggaaaagg 
gtcagtgagg 
gcgctgacct 
gaagggttgt 
cgcttgatca 
gatatcaaaa 
ctcaatctgt 
gtcaagccgg 
catgcgctga 
atcgccctga 
cacgaggaca 
cgccgccaac 
gaagaagccg 
cctggcgaga 
ctgatcgcac 
acgctgtaca 
cacatctaca 
gacgacatgt 
tgctgtgcag 
gataatccca 
gatctgtgac 
aactatcgtg 
cgcagagcgt 
tatcgtgcga 
gtgcgccaag 
gggaggtgtt 
ttcggcttac 
tcatgccggt 
cacaccacaa 
tccataaaac 
atgtgcaagg 
gccactggcc 
gttacctcaa 
ggtcgtccag 
agtccgggcg 
caaacagcaa 
tgacggcctg 
ctcactttct 
gcgaccgccg 
caccacagca 
cagaccttcg 



tctattcgta 
ctggtggtga 
tcgctgttgc 
gcggcagtgg 
ttcatcaaca 
gacgcgcaga 
ccgtgcatgc 
gcggtagtgg 
ctcaatgacg 
ctgctgggga 
gacagccagc 
acgccgctgg 
ttgtctctga 
ttgccaggca 
caacacatga 
ctggcgatca 
aaaagcatgg 
accggcgaat 
ccgtgagcga 
gccaggtcgg 
tcacgctggg 
tcaagggtat 
ccgtggtact 
gcagttgggt 
agatcgaccc 
tgctgagcat 
cattgggcac 
tcgagctgtt 
tcgactttgc 
tcaagaagga 
tggcccatga 
acatgctgct 
cgccattgcc 
aggcaaagaa 
aggtcaacgt 
aagtggtccg 
gaacaacctt 
atctgtgacg 
actatcgtgc 
gcagagcgtc 
cgacgctctg 
cacgcaaggc 
cgctccgcgt 
cgcaaagcac 
catatcaagg 
gcggtcagtg 
ggatgtcgtc 
agtcgcgtcg 
gtcgttgcag 
cgatccccca 
atactgctgg 
tcatttctaa 
accttcgcgc 
tttgcaagcg 
gagcaagagc 
ctttctttgg 
tcacccggct 
catcggcttt 
acaccccagt 
cccggatcac 



tatgtgaggc 
tcctgaccgc 
aggcactcat 
gcatgacgct 
tggctttcga 
atttcttcga 
tgctggtgcc 
cggtggtagc 
actggtttgc 
tgctgctgta 
gcggtgcact 
gcgagctgtt 
ttacccaggt 
tgaccgcgga 
tgctgtatgc 
tcgggctgta 
ccggtatcgc 
tgtcgaagct 
aaaaacagaa 
acaaagccag 
cctggccaat 
tgaacggcct 
gagctttacc 
gcagatcggc 
cttttcccac 
cctcaaggcc 
tctgatcctg 
tcagcatgtg 
catgcagaag 
atacaaacag 
aattctcaat 
ggtcaacccg 
gatgattcac 
agcgggcatt 
cggaaaatac 
ccagctggaa 
ccactcttgt 
cggagcgtca 
gacgctctat 
acgcaaggca 
cgtccggtct 
attccactgg 
cggcatgccg 
ggtaagtatc 
cagtgttcat 
catttcctcg 
caggctggcg 
tacgtccaga 
tgcatcacgc 
atcatggtgt 
gctctgggcg 
tcctttaact 
atgtcgggcg 
ttccccgcac 
gttaggctac 
ccattttaga 
gcagccatgc 
cgtcgatacg 
cttttttgac 
acagacttcg 



ggtcatggaa 
tccgccactg 
gcagattcag 
ggccatgacc 
tcttatcgcc 
tctcatcctc 
ggccttttgt 
gatgatccct 
catcggcggc 
tgcgcccttc 
cagcggcggc 
tcaggaaacc 
catctgggac 
aggcctggac 
cgcgcccttc 
cgcgcaacag 
ctttctgctg 
ggctgatctc 
aaagccacgc 
gacctcggca 
gaaagcgtga 
ttcatgtcgg 
ttgtgcagcg 
tttctcttcg 
gccaaacaga 
gtggcgatcg 
ttggccaaca 
ctcagggtga 
tatttccatg 
tccgagggcg 
caggagccaa 
actcactatg 
tgcaagggcg 
ccggtcgtgc 
atcccccgcc 
gagatcactg 
tcgctcccgc 
cgcaaggcat 
gtccggtctt 
ttccactgga 
tgagtatcgc 
agcgtgagga 
ttctggacgc 
agggaggggt 
gaacccgtct 
ctcaaatggt 
taggaacccg 
aacaggaagt 
agttgatcgg 
tcgcgccgac 
gcaacctgag 
gcacgacagt 
gccaccaaat 
agccgtgggt 
gaatcatcat 
tatctgcggc 
atgaggccag 
taccggcttg 
gaaccactgc 
tgtttcaaag 



gcgttggcgt 
ggcgttgcgg 
gatcagaccc 
ggtcgctgga 
cgctctggag 
ggcatgggcc 
ttcaaatacc 
gcgcccggca 
ctgctgctca 
tggatgttcg 
cagataaacc 
ctggtcatgc 
agttacatgg 
gtgtttctcg 
atcgcgttgc 
ctgaacgtct 
gtctacctgc 
aagtcgatac 
ccaaacagat 
agctgctggt 
atcgcctgca 
cagtcgagct 
tgggtctggc 
cgcccaaggc 
tgttttccgg 
gagcgacctt 
gcgatctggc 
ttctggggtt 
ccaagaagct 
acccgcatgt 
gcgctgcgcc 
cggtggcgct 
aggacgaaga 
aaagcatctg 
ctaccctgct 
acgaagtgat 
gtccgacctt 
ttcactgggg 
gagtatcgcg 
gcttgaggaa 
gatggtaccg 
acgataatcc 
tctgcgtcct 
gtataggagg 
tgcctgggct 
ccagccctgc 
gtttttcgtt 
agtgattgaa 
gacgcaacgc 
tgacaaacgc 
cgtaaaatgc 
gtaatcccgc 
gaccagctcg 
ggcacaccct 
ggtttcgctc 
tggcgcacag 
gccgcaacgc 
tccgtgttgt 
gagcgctgcc 
ggcagggtct 
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23281 
23341 
23401 
23461 
23521 
23581 
23641 
23701 
23761 
23821 
23881 
23941 
24001 
24061 
24121 
24181 
24241 
24301 
24361 
24421 
24481 
24541 
24601 
24661 
24721 
24781 
24841 
24901 
24961 
25021 
25081 
25141 
25201 
25261 
25321 
25381 
25441 
25501 
25561 
25621 
25681 
25741 
25801 
25861 
25921 
25981 
26041 
26101 
26161 
26221 
26281 
26341 
26401 
26461 
26521 
26581 
26641 
26701 
26761 
26821 



ggccagcgcg 
tgtcgccccg 
catcgacgtc 
tcagtcgagg 
ggaacagcag 
cattggcttc 
cctgggtgct 
cgatctggcc 
gcgaacgcgg 
gctcggaagt 
ccagggtcag 
gcaattcggc 
caaacaggtt 
gttcagccag 
gcggaacgtc 
aaatcaatac 
atgacccttg 
agtgctcccg 
ctgtgctgtt 
cgccacgcga 
actgctggtc 
cgacaacatc 
gcacaaactc 
agccccagcc 
ccaccgcatc 
acagcgtgtc 
actcttcagg 
cgagcgacat 
gggcatcata 
ggatcacccc 
ggttgccggt 
gtttgcatca 
aggtcaccga 
ggatcgatgg 
aaggcttcga 
cgcaccccgt 
ggcatcgctg 
atcgatgact 
ggcgacatcg 
ctcggaggct 
gagatggcgt 
aaccttgagt 
gtgtgtatca 
tatcgaacgc 
gcttgagttg 
ctatttgggg 
ttctggccag 
attcactcaa 
cctgaagctc 
gaaggtgctc 
ctgccaggat 
cgcgcagcag 
tgatgcagag 
taacgcctga 
gcactccaga 
cgcgtcctcg 
gcaccattga 
gcttagcgcc 
gggcgctggc 
ggccatgagc 



caggccttcc 
cgttggcagt 
atgggggttg 
ggtcagaatg 
cttgccgata 
cagaccgtgg 
gacattgcct 
atcctcgatg 
aatcacttga 
gggcgtcagg 
gatcgacggg 
agaaaacttg 
ggcacccacc 
ttcattgggg 
cagctccttg 
ggagttgtta 
gcagtttttt 
tatcgatacc 
ggcaacactg 
acggctttcc 
acggtagcga 
ttgcttgtcg 
cacgtatttg 
caaacgcttg 
cggcgagact 
gttatagacg 
gttctgagca 
accgaactcc 
ggcgtaggcg 
gatcaacaat 
gattgaggat 
ggctgagcat 
ggctgcaggg 
ccagcgcgcc 
gccgttgcgg 
cgccgacgac 
gacgctgggc 
gctttattga 
acctgctgct 
tccatcgctg 
tgcaagctgt 
ggagcaaacc 
ggcagcaggc 
gccattggca 
ccagtgctcg 
gtgaacactg 
cgcccgcgca 
cagttcttct 
gccgagaaac 
ggctttctct 
gtcgcgcgcc 
cgtagcggcc 
aagctggttc 
agtttgtttt 
cacagtcgcg 
tgttcaaact 
ccgtcaggtc 
tgctcacgct 
ggctgagccg 
gcatcgcagt 



ggggcggggc 
cggcctccgt 
ccgttctgta 
aacagccgct 
tagggaatgt 
aagccgccga 
cggcgcacac 
tccacgatca 
aggctggtgc 
tattcggtgc 
ttggcgatga 
ctggcgttct 
tccgacgctg 
tcgatgtcga 
accagtttct 
cgcacatcag 
tgctgttgaa 
attggagccc 
ctgccctgcc 
agcagctctt 
atagtccgat 
gccttctcgt 
gccggaccac 
tcaacaagac 
tcgatgcgcc 
aaccactgga 
cgaatacgtc 
ctggcaaagt 
gtgtgtttcc 
aaaggcaacc 
cgaacgcccg 
gcccgcgcgc 
gtttgccatc 
gtcgcaggca 
gtcaccggcg 
cgccagccga 
agaccactgg 
taccgtgctt 
gggacatcgt 
cctggtccat 
tgaaactgat 
cgccgagcgg 
tcgacaccca 
tcgctcagac 
gaaaaacggc 
agcgtcgcga 
cgttcggcca 
acggcggtca 
gcgttggcgt 
tgctcaaggg 
agcaggctgt 
gtcagagcaa 
ggattcaggc 
cgggtgcctt 
accagtattg 
ccagccagac 
cgtcgctttc 
gcgagggcgt 
ggtcaatgcc 
cactgaccga 



cgtcgatcat 
cacgtcgttc 
catagcgtgc 
cgcgctgact 
cgcccaacag 
tgaccagcga 
tgggttgggt 
tttggacctg 
ccgccgtgat 
gactgaggtc 
ctgacgcaga 
gcaagaacaa 
ccgggcattg 
gaatgaccgc 
ggtacatggc 
cgcttacgcg 
gttcaatacg 
aggttgtaag 
ccgccaacaa 
gaagaatact 
cagccgcgtt 
cgggcttttc 
gaaccagaac 
cgacatcggt 
ccgaggtgtg 
agtggtattc 
catcgaggtt 
cagccagggc 
aggcttctgg 
acattaaggc 
gacaaagtgg 
tgattggcca 
cagctgacca 
cacgccaggc 
tcgtacgggt 
gcattggcgt 
ccaacgatct 
ggcacgcagg 
gaatgcctgc 
gttggtgtga 
catgtcctgg 
ttccatcatg 
gcagcccctt 
gcaagctgtc 
tgtctgccag 
ccgcttcatt 
gcgtggtgtc 
ttgcctgctc 
tttcccagaa 
ccagtatctg 
cggcgatgtc 
tactgcgttt 
agccgtgacg 
gccgggggtg 
cggcccaagc 
accggggcgc 
gttacgggag 
cagcgccaac 
cagactctgc 
taacccacga 



ttgggtaaag 
cttgattttc 
tggattgacc 
cagttcgcga 
cgggatcttg 
gccgtgctcg 
gtcattgatc 
aggcttgcca 
gggcagaatg 
gatcactgca 
accattgcct 
cgttgaactg 
aaattccagc 
atcgatttcg 
cttgcgctct 
gatattgcct 
cggtgcaatg 
gcaggccggg 
gttcacgctg 
ggcgacaccg 
ggcgtatttg 
gactttcttg 
cacgccttcg 
cagcgccgtt 
ctcgctggaa 
ctgactcagc 
tccctggaca 
agtagacaac 
ggtgaccgcc 
cttgcgcatt 
gcgtcgtgtt 
ggctttccag 
gcactacgca 
ttgcgccgcc 
cgagcagttc 
catcgatcca 
cggtgaattc 
cattcattga 
aggtcttcga 
gcacggctca 
tgctccagca 
cgatcaagtg 
gcgcaggtct 
cgaggcgatc 
ccactcagcc 
gagctggctg 
gtctaacaag 
ctgcaacgcc 
ctgcgccagc 
cgtggcctgc 
ttcgcggcgc 
ggcgagcatg 
cgccacatga 
tcgggcactt 
caggcgccca 
agcgctttgg 
aagcagatgc 
cagcgcagca 
agaaacacgc 
gcgttggcga 



actttctgga 
ttcatctggt 
tgatcgccga 
ctgcgggact 
tgaatcctgt 
gcaatcaccg 
gtcgacacat 
tcgttgtcca 
tcagcggccc 
ggctgattct 
tcaagcgcat 
gtgccgccat 
cgactggaca 
atcaggttgc 
ggcaggtcgt 
tgcctgaggc 
cccctgttgc 
gccgcgacac 
tcaatgcttt 
gccaccacta 
agtggcagca 
ctgtagtcgc 
tcaggcagcg 
tgcaggtcgt 
gggctgacat 
cgctcaagaa 
ggcgacatgt 
tcggtctgcc 
cacgtggcag 
tcacactccc 
acgaatagtg 
acgatcgagc 
gcgggtctgc 
ctcgccaagc 
gatactgcaa 
gcagtccagc 
actgaattcc 
cggcaatacc 
cggtgccact 
ccgaattgtc 
gaagggttca 
agtgcagaga 
gcccaagcga 
gttgcatcgc 
acggggtcgg 
gcggccaggt 
tgccgcaggg 
tcgcgctgca 
gcctgctgct 
tgccgcgcgt 
aagatcggtt 
ggcgtattcc 
tggcctgcca 
cattgggcgg 
gcagaagacg 
tcaaccccca 
actgcgccag 
ccggttccgc 
catgacggct 
atcggtcatg 
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26881 
26941 
27001 
27061 
27121 
27181 
27241 
27301 
27361 
27421 
27481 
27541 
27601 
27661 
27721 
27781 
27841 
27901 
27961 
28021 
28081 
28141 
28201 
28261 
28321 
28381 
28441 
28501 
28561 
28621 
28681 
28741 
28801 
28861 
28921 
28981 
29041 
29101 
29161 
29221 
29281 
29341 
29401 
29461 
29521 
29581 
29641 
29701 
29761 
29821 
29881 
29941 
30001 
30061 
30121 
30181 
30241 
30301 
30361 
30421 



ccactccgaa 
agaaaggctc 
cgtcgcaaca 
aacgcaagac 
tccagcttga 
acaaacacga 
ctggcgacca 
ttgatgaaca 
agcaccacat 
tcctgggaca 
tccttcttga 
agcacgcgca 
tccagacgtt 
tcctgctcgg 
atgcacaaca 
tcaacttgtc 
acgagcactg 
acactttctt 
cggacagcgc 
gcatcgccgc 
gagcattctg 
cactggttct 
tgccttgcaa 
cggcgccggt 
tcagcagttg 
cacctgtcgg 
actgcaaacc 
cattggccgc 
cgggcgaacc 
gctgttgacc 
cattgtcttc 
tgttgtcgtc 
ttgttccgcc 
tcaggacctg 
cgacacccga 
cggcagaggc 
tgatgtcatc 
tgccgagcgg 
cgatcacagc 
gcgacacggg 
taaaacgccc 
taactgatac 
ttgccggcct 
tgcatcgcga 
tcaaccgaca 
gcaaccgtgt 
ttggtgagtt 
accagcgtga 
tttgcaaaaa 
tccagttcca 
agggcttttt 
tgcggctcgg 
tcacggatat 
tcgggaacgg 
actatgcgat 
aaatacagat 
gaggcgatca 
cgggtttcca 
tccaggtaca 
ccggtgtagg 



tgtgcccact 
atcatgcacg 
gactggcgcg 
cgacaggtgc 
agggcccgaa 
tggaaaactt 
tctgttgaat 
ccgcagcaga 
gcaccctggc 
aggcgtagat 
aaatctcccc 
cggcgcggtt 
tacgcgcatc 
acaagccagt 
gcagcagccc 
gagcgcctga 
cgacaacgcc 
catctggcgt 
actggctatc 
attgaatagg 
cccaagctcc 
ccgtcaggcg 
cagcgcattg 
gttgccagca 
acccaggtcc 
ctgcgtggaa 
acggtcgatg 
gggacctgtg 
cagaccgccg 
aatgacgtcg 
cttcagctcg 
catgaactgg 
ttcaccactc 
gctcatcaga 
accagaaccc 
accgaaattg 
gatgctgtta 
tgaggtttca 
cttgagctct 
gaacaatgat 
atcccaaggt 
ctttagcgtt 
ggatggcgtt 
tcagcttcgc 
ggctgtcgcc 
tgacaccctg 
tggaggttaa 
ttgcttggta 
atcagatctg 
ggcttaccga 
catcaactcg 
cgcccagcaa 
tgccgggcca 
gaacaccgag 
cagactggtt 
cgcgacgaaa 
cgcagatatc 
gcaccctcag 
gcgtgccgcc 
caccgctgac 



gccaggggtt 
tgccggcagc 
ccagtcactg 
cacccagagc 
gctcacccat 
tttcgaatcg 
acgtccgcgc 
agccggttga 
cacaatgact 
gtaacgggca 
cagcgtggtg 
catttcgctg 
gatatgctga 
gaacaaatca 
tgcgctcaga 
gcgctcttgc 
cgactcatct 
aatgcttgct 
cggtcggaca 
tcgacatccg 
ggcgatacac 
gctgtcagtc 
atcagctgag 
tcctgaagcg 
tgattggaca 
ttgtcgaccg 
agttgaccga 
ttggcatcga 
ccactggtaa 
agagccgaac 
ttcatccacg 
gcaacttttt 
ggtgtcagca 
tcggattgcc 
gccccgccaa 
tcgccgagct 
gccgacttgc 
tcgagctgcc 
ttgctggaag 
gcagaggttt 
agcggccccc 
cgtcgctgtg 
gagcacgtcc 
gccgttggcg 
ggtgcccaaa 
caatgcgccg 
tcctgcaaat 
ctcactaggt 
caattctttg 
atccaaacaa 
caattgcgat 
ggggaaaccc 
gtcgtggccc 
ctccctcgcg 
acgtagcgga 
ctgcccccgc 
caggttgatc 
caatttggct 
ctgcgccgct 
cacgccgaat 



gcaccaccag 
gttgaacgac 
cgcaccagca 
atcaggttcc 
tgcgtggtct 
acagattgcg 
acactgtcgg 
acaggttcgc 
ccgtcgatct 
cgctcttcaa 
cgcgagcgcc 
gtggcgacag 
tcggcgaggc 
gtctcatcac 
aaattcacgg 
tcacgacctt 
gcacgatgtc 
gtgaaagctt 
ggtgcgacgc 
cctgaacggg 
ttttcaaatt 
aggccacagc 
ctgccacttg 
tcgcttccag 
cgttgcccgt 
gtgtaccaag 
tcagttgacc 
ttgcaggatt 
cgccactggc 
gaaactgagc 
agccgccgtc 
ccagggtcgg 
gatcgtccag 
cggcacccgc 
tgccaccgcc 
tttcgtggat 
catccgcagc 
cactttgggt 
tgctggtgtt 
gcaacgaact 
tctgatgagg 
gcactgatct 
atggtctgct 
tcggactctt 
agaatgtttt 
ccgacaccgc 
gcgaccatga 
ggcagcagcc 
atgcgtcgat 
ttgtcgtggc 
ttgagcccac 
agcacatggc 
agcagcactt 
gcggcggccg 
ggaagcttga 
tcgacggcgt 
gtcgacgtcg 
tgcagggcca 
tcgacataac 
aactcgctct 



tgaatccagt 
cgcgactgcc 
gtgcaccgat 
agaacggcaa 
cttggaactc 
tggacatacc 
gatcaagtgc 
ccggcgcgat 
gcgacagcgt 
gcggcgtcga 
gaggcagacc 
tcacgacaac 
gcgctacgac 
tgcagccgcc 
aaacctctac 
ggtcgtcaac 
tccaggatct 
ctcggtactg 
tgctggcccg 
ttcggagccg 
gctgagttgg 
ctggttagtc 
cgcagcgctc 
cccgcgttga 
cgggttagcc 
accaccaccc 
tacgtcgacg 
acccagggag 
atcaccttgt 
ggtttcctgt 
ccgagtaggg 
catgtcatca 
cacggctttg 
gtcgctgctc 
accgccaccc 
cagcttgtcg 
catggccttg 
cagcgcctga 
ggcgctcaca 
gatgctgtta 
gggcaatcag 
tcttgttggt 
tcttcattgt 
tactggcctt 
tctgaagagt 
caacggcgct 
tttgatgccc 
tgcgatacgg 
agagcgtacg 
gcttgagcga 
aggccaagtg 
gtttggctgc 
tgtgcagcag 
taaaacgtgt 
gtgtcaggac 
cgtccagcga 
aacccagccg 
gcggcatgct 
cgactctgga 
cggcgaggga 



gatcctcggc 
aaacccgatc 
cagcaacacc 
gttcgtgctg 
tgcagcaggc 
gggaatactg 
agcagagtgc 
gcgctcgggc 
ggcttcaagt 
aatcacccct 
cgcagcgtcg 
gccggttttc 
ctcattggaa 
gagcagcagc 
tgcaggttgg 
gccatttgca 
tcggtgttcg 
cccagccgct 
ctgtcagggc 
agcccctgat 
gaaatggtca 
tggttattgg 
gattgcaggt 
cgcaagccgc 
actggcgtgc 
gacgaaaccg 
ctggcattgc 
ctgtcactca 
tgctggccga 
gcatccaggc 
aactgggcct 
ctggaaaagg 
ccgaggccgt 
agaccgccac 
gcgccgatgc 
agcgatgcag 
gcgagcattt 
accagctgat 
tcgctgttga 
agtgcttgca 
aaataattag 
agagtcttct 
ttcctgggcc 
ggcttgtgca 
ggcgttggaa 
gttaccaagg 
cttaagattt 
ttccagcgtc 
ggcgtggcag 
ctcctgaatc 
ctcttcgccc 
agccttgagc 
tgggcaaaca 
gaacagggga 
gttcaggcga 
gcattgggcg 
ttcaagcgct 
atcgatctca 
gcgatcagcg 
ctccggaatg 
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30481 
30541 
30601 
30661 
30721 
30781 
30841 
30901 
30961 
31021 
31081 
31141 
31201 
31261 
31321 
31381 
31441 
31501 
31561 
31621 
31681 
31741 
31801 
31861 
31921 
31981 
32041 
32101 
32161 
32221 
32281 
32341 
32401 
32461 
32521 
32581 
32641 
32701 
32761 
32821 
32881 
32941 
33001 
33061 
33121 
33181 
33241 
33301 
33361 
33421 
33481 
33541 
33601 
33661 
33721 
33781 
33841 
33901 
33961 
34021 



gccgcgcaat 
gcaatcgtgt 
gaaatacttt 
aacctttcat 
gcgtcacaga 
gtgatacagg 
gtgccgcttc 
ccccgtggca 
gaagcgcttg 
cagctgcaca 
ctgggtgaac 
gtgaagcgtc 
cagttcatcc 
ggtcgagccc 
gctcaacggc 
gtagccctcg 
gctctctgcc 
gctggacaac 
caacagcaag 
tgcagttacg 
ctcatggtgg 
tttagttccc 
tcaccatacc 
atcgttactc 
catagggcct 
gcagcaagcg 
gaaacatata 
accgaacaca 
aacaataaca 
cgataccaac 
ttggatactg 
agtcatcatc 
cagccatcta 
aggttctgtg 
tctaaatgtg 
tgtagcgctc 
tcaataaaag 
aaaaagtgcg 
gtggatcact 
gatactcgtc 
gtatccggtt 
ccggcgctga 
ccaccggcaa 
gcacgggctt 
cgggcgcggg 
tgctggcagt 
tctgggtttt 
ccgagttcag 
ggtctacggc 
gtttcacacc 
cgtaatcctg 
gtcggttggc 
ccagcagcca 
tgctggaggc 
tgcgccccgt 
tggggttata 
cgtccaggcg 
taaatccgcg 
ggccatcgac 
caggcagatc 



tcatcgccac 
ctttgcccgt 
cggcaactat 
caagactcat 
ccccggacct 
gtgcgtcttg 
aaggcatcct 
gggtcaagcg 
gctgcagact 
cgcccgctgt 
tggtcgaaca 
agcacgttga 
agtggccgct 
agacgctcga 
atgctgtcga 
cgagcccggc 
agcgactcgg 
tcgtgaatgc 
tccatatcca 
ccccaacact 
gtggcaagcg 
cgggaaatga 
aagacgaccc 
caatgcgaac 
ctccacacct 
ccccaaaatc 
agacttttcc 
caaaacaaga 
cgagaaaaca 
cagcttagtt 
gcaaagcggt 
gcattcggca 
aggaatcgcg 
aaccaggcgg 
tgtaatcttt 
tacatcatat 
aagttgtatt 
cttttcaggg 
ggcaaaaacc 
gtcatcacgc 
tgtaagcgga 
ttcagctgca 
cggctgagcc 
gggcagcggc 
cagacgctca 
caccggggac 
aggtatcagc 
ccgcaacaac 
atggttaagc 
gtaggcattg 
ggtttccttg 
ctcaatcgcc 
gtcattattg 
caccacgtca 
ggatggaatg 
agagctttcg 
ctcgacaata 
attgctcagc 
cagcctgcag 
ggagattttg 



caggcgccct 
gccggtctca 
ccccagattc 
cccatgaccc 
cgcagagtat 
gcaactccaa 
ggatgagcat 
cttccacagg 
tcaattcgcg 
ccagcgcagg 
atggcaggat 
gccgaaaaaa 
gggccgaggc 
tacctcgact 
tttcatccag 
atacgccggt 
gaatggcggc 
ggttggccag 
gaaacgcgct 
ctcggacgtc 
gagtattaat 
gaaaaagatc 
taccgataga 
aagcgcttac 
caaagcagct 
gcgcaatgaa 
aaaacaacta 
aaaccaaacg 
aaaaacagcc 
ccgccccacg 
catagccccg 
atccgaacaa 
gaaaggattc 
ttaatacgat 
cacttacatt 
ggctaagcgc 
gaggcagatc 
gttttcaata 
acgacgcgcg 
ttgcgaggcg 
tcaggttcca 
ggagctggct 
gttttgggcg 
ggacgctctg 
gccgcccgtt 
tcaccggtaa 
agacgtcccg 
tggccctcat 
tcgactacgt 
gggttgcgca 
ggtaaattca 
cgaccgacgg 
aactgatcat 
cggcgagcgt 
aattgccaca 
atcatcggca 
aaatgcagat 
aaccagtcgc 
cgctgggcaa 
tctgcagccc 



ttgcgggctg 
cccgatagca 
ggaacccgct 
ccaggacatc 
cggcgctgca 
ctcctgaagc 
tttctcgatg 
gtcggcgccc 
gatgttgccc 
agcgggacgt 
ctgttcacga 
caggtcgcga 
aatgatccgc 
ctccaacaca 
gtacaaggtg 
gaatgcaccg 
gcagttcatg 
tgtgtccttg 
attcattgca 
cttatcgatg 
accacgtctt 
acaaagttga 
ctcaggctct 
agcgtccatg 
gtgatccggg 
acgcaactaa 
aaggggtcac 
gtttttagcg 
tgacactaac 
agcagtcgga 
gtttttcggc 
aagcccacct 
agcgtagctt 
cgatgatcgc 
cggctaaaaa 
catctttagg 
aatattgtcc 
gaacaatcga 
gccccgtagg 
acgaacggtc 
caacaggtgc 
gtaacgcctc 
aaggcaggtt 
caacgcgcac 
tcacaatggc 
tgcgcgcgat 
gtccatcgaa 
ccacacccgc 
caaaatacgg 
caaccattga 
gattccagta 
tgccctcccc 
gcaagcgggt 
cgtaggtcgc 
aacctgccgc 
gcagtgccag 
aagggctggc 
gctggcgagc 
cccgctgcca 
gcgaaccttc 



acatctcatg 
gcacgtcgat 
cctcgtccag 
aacgttggat 
actcccagtt 
accgcgtcga 
atgcgcattt 
agcaagggga 
ggccagtcgt 
ccgaactcgg 
cgtttgcgca 
cggaaaagtc 
agatccaccg 
cgcagcagtt 
ccaccactgg 
ttgaccacac 
cccacaaagg 
ccggtgccgg 
atttgatgac 
cctgtactca 
acaaggcaga 
gaattactat 
tgagatgatt 
cgctggctcg 
acaagagcag 
cttctcgtca 
aagtaaggaa 
gcgagcttaa 
tatttgcact 
tttccgaaca 
accactcagt 
gcttagacta 
aataccggaa 
gtgccatcac 
agttcatcaa 
gtccaaaaaa 
gacaacgaga 
gtaaaaccgg 
cagctcgcgc 
atccctgatg 
ggattgggcg 
aggcgcagtg 
ctcggctaac 
tggacgctca 
tgaaggggtg 
gctggtcgtg 
ggtctttttg 
cgtggccgcg 
cgtgttggcg 
gagcgccaac 
gtccacaggc 
cgcgttatag 
caggtaatcc 
gctttgatgc 
agcggccgga 
ctccagcggc 
ccggacactg 
gatacgctca 
cacgtcctcg 
cttatcatct 



aatccgtcgg 
acccagttgc 
atcatcctca 
aaccacacct 
ccttcatgcg 
aattgtgcct 
gcgtgcgcag 
agccgagtac 
ggctgagcag 
cagcgatacc 
aggctggcaa 
cttgttccac 
ggatgaattc 
tggcctgcag 
aggcctctat 
cgaataactg 
gtcccgacct 
tttccccgca 
ccgctgataa 
tcgttgcact 
aatatattaa 
catattaata 
gctttaatct 
ccccgcaagc 
gcacctttga 
ctactcgaga 
gcagaagaaa 
agaagcgaac 
ttagaacagt 
acacagaggc 
actggcattt 
tttccaggca 
ccgcaggttt 
ctagaatgtt 
aataatcata 
cgggtaacgc 
aaaagcacca 
ggttattggc 
ggaccgctgc 
cggggcaact 
atctctaccg 
ggctgctgag 
tgggccgact 
gccacaggcg 
accagcggga 
agcacgcgat 
cgcaggaatg 
agctgggtca 
accggggtca 
agtctgggca 
agaccacgcc 
gcggccagcg 
atcgccgcct 
agattgaagc 
gagttggcca 
atgttgcgct 
gctcccgtga 
ttcatgcctt 
ccgttataaa 
cccccccaat 



http://www.ncbi.nlm. nih.gov/entrez/query. fcgi?cmd=Retrieve&db=nucleotide&list uids=1332..- 2/4/03 



34081 
34141 
34201 
34261 
34321 
34381 
34441 
34501 
34561 
34621 
34681 
34741 
34801 
34861 
34921 
34981 
35041 
35101 
35161 
35221 
35281 
35341 
35401 
35461 
35521 
35581 
35641 
35701 
35761 
35821 
35881 
35941 
36001 
36061 
36121 
36181 
36241 
36301 
36361 
36421 
36481 
36541 
36601 
36661 
36721 
36781 
36841 
36901 
36961 
37021 
37081 
37141 
37201 
37261 
37321 
37381 
37441 
37501 
37561 
37621 



agaccagccc 
actggcagcc 
gcaagcgtgg 
tgtccaccct 
cacaacgcat 
catgccactc 
aatggacaga 
ttcagccaac 
aacccaggca 
tcggtaaatt 
cgaaaaacgt 
gcgaatccgg 
tgcgtaacca 
aggcagcccc 
gcagcccgtc 
ccactcgcca 
caatgctggc 
acggaccgat 
tgaaagacga 
agggaaagcc 
ttcttgccac 
accagttgct 
ccgtgatccg 
aaatggagcg 
agctcccagg 
agcgtatgcg 
aaataccgga 
cggtttatgc 
tggaagtcga 
gcggcaaaac 
cgccgaaaaa 
gtttgagtgg 
gtagcgcatt 
agctggaagg 
acggcggtgt 
acgagcaaag 
acaatcgcgg 
gtgcgggcct 
gctggaaaga 
cttatgtact 
tggcttttga 
gcaaagagat 
aacgcttcgt 
ccgtcacact 
aacacaacct 
ggcaatcgac 
gagggcagcc 
aagacgccga 
agcgcccggt 
agacctggcg 
gcagcggtgt 
acaaaaacac 
gctaccaggg 
aactggagct 
cgcgcatcac 
tggaaaccct 
agagctatgg 
agctggccaa 
agctcaactt 
aagtggctcc 



cgacaccagc 
cacacacaag 
gctcgatacg 
acgtggcacg 
tgctgaatcc 
atcctgtgaa 
gagattcacc 
cgtcacccct 
acgctcttcg 
gttccagaaa 
caagacgccc 
tttttccaac 
ccccaaccag 
ccgtaaaaac 
gtacctcgat 
attccggtca 
cacagatcct 
tccgccgcgc 
aaaactggtt 
tgacttctcc 
acccaagcaa 
acaggccaac 
tagcagcaac 
tgaagacggc 
caaggcacac 
tgtgcatgag 
aggcctggag 
aaaaagtgac 
agacctgcag 
gacccaggcg 
aaccaaaggc 
cgacaagctg 
cgagggcgat 
cgtgcccctc 
tcacgcgctg 
ctcaaaactg 
cctgaccatg 
ggttggcctg 
cgcaggcata 
caagggcggc 
ccgcaacaca 
cgaaggcctc 
cgccctcgat 
cgacattccc 
gcacgccctc 
aaagctgggg 
ggtaaaggca 
gggcaagggt 
agaagaaaac 
aattccaaaa 
ggagaaatcc 
cgcagaaacg 
tcgcctgggt 
gatccatgag 
cgcactggaa 
gcgcgacgag 
caaggcgaaa 
gccgtcggtg 
caaaagctct 
gtctgctgaa 



cgcggcggac 
gcgcccatag 
gggaaggcga 
ctcgccgtta 
tttcagccgt 
ggaattttca 
gtgcagtcac 
gacgcacgtg 
cactcgttga 
tccaaagcgc 
ccgcctgctt 
agcagcccgc 
gcgagcagct 
ctgcgcgtaa 
tcagacaacc 
cctgacagtc 
gatcagccca 
gagcccatgc 
cgcaactcag 
acgttcaata 
acctacctgg 
gggcactttc 
gaagcactcc 
aacattcaca 
atcgctcaca 
gaccgtctct 
gataccgctt 
gatgccgtgg 
tcattttcag 
atcctactga 
cttgagctcg 
tttatcgctg 
gacccgaaat 
ggaggccaca 
atcaaaaacc 
caaagcggct 
cccccgccac 
agtgaaggac 
aaagatatcg 
aagctgcacg 
gcactggccc 
gacgaccgag 
gaccagaaca 
gggctggaag 
accagtaccg 
gaccagttgc 
cttttcacca 
cttatgcagc 
ggtttgaatg 
accgggctga 
aaaaaagcca 
ccccgctgga 
ctgaaagagg 
tccgggggaa 
gcaaaactgg 
ctggaaaatc 
aaccttaaac 
cgcatgcagt 
ggacatgact 
aaccccacca 



ggtcctgacg 
cgaggactgc 
cggcgggcat 
cggttccctt 
aagtttttcc 
cgtttggtat 
catcgatcca 
ctgcaactga 
gcagtgtcgg 
cgcagcagaa 
caaatgtggc 
aaaataccca 
cgggcgcgca 
ggtttgatct 
cgatgaccga 
acctgcaggg 
gcagctccgg 
tgtggcgcag 
agccacaagg 
cgcccggcct 
cccaccaaag 
tgcacctggc 
ttatagaagg 
tcgacaccgc 
ttaccaatgt 
atcagttcga 
tcaacagcct 
tcgacttgtc 
tcgcgccgga 
ctgacatgag 
acggcggcaa 
acactcaggg 
tgaagctgat 
accgcgtcac 
gtcagggcga 
ggaacctgac 
ccaccgccgc 
gcattcaacg 
atcgcctgca 
cactcaagat 
agaccgcacg 
tgatcaaagc 
agctgaccgc 
gcgatatcaa 
gcgggcttta 
gagcccgctg 
acgacgacaa 
tcaaggcagg 
atgtgcactc 
cgctcagaat 
gcaccagcga 
tgaagaacgt 
tttatgaaac 
ggcctccggc 
ggcctcaagg 
acagctacac 
agcaggacgg 
ttggcaagaa 
tggtcaagga 
aaaagttgct 



cggcggcgaa 
gatttgaaca 
gggcgggaat 
ttgaaaccga 
gatggaaccc 
caggcggcta 
ccggaacacc 
cctgcaggaa 
caagcgggcg 
agctgccacg 
tacgcccaga 
tagggcaccc 
gacgcatgaa 
gccgcaagac 
tgaagaagcg 
ctctgacggt 
cagcaaaatc 
caacggaggc 
cagcattcag 
ggctccattg 
caaagacggc 
gcaagacgac 
aaagaaacca 
cagcggccgc 
gcttctcagt 
cccgataagc 
gtccactggc 
gagcccgttc 
caacagagca 
cccggtgatt 
ggcgcaggcg 
cagactttac 
gcccgagcag 
cggattcatc 
gactcactcc 
caatgcgctg 
tgaccggctc 
ctgggacgca 
acgcggcgcc 
tgcggccgaa 
ctcgacaaaa 
ctttgcaatg 
ccacagtaag 
gagcctgtcg 
ctgcctgccc 
gacgccggtt 
cgtgctcagc 
ccaatggcaa 
gcgcatcaca 
ggacgtcaat 
gttcatccgc 
aggtgaccat 
cgagtcgatg 
acggggtcaa 
cgctacgctg 
cgcgctgatg 
cattctcaac 
gcttgctgat 
gctgcaggat 
cggcacgctg 



tagtccacag 
gcgcgagcca 
gtcctgagcg 
gatcggcgca 
gctggcattg 
tcagcgataa 
ggaagcatca 
agagccgaac 
ctgaaaagcg 
ccgcccaccg 
aacaaagccc 
aagtggattc 
atacacccgg 
cgccttgagc 
gtcgcaaatg 
acgcgcattt 
ggtgattcgg 
cgtttcgagc 
ctggatgcca 
ctcgattcca 
gtgcacgggc 
agctcgctgg 
ccggccgtga 
aaaacccaag 
cacgacggcg 
actcgctgga 
ggcaacggct 
atgccgcacg 
gcgttgctca 
ggcgggctga 
gcggcggtcg 
agtgcggacc 
gcaaactttc 
aacggggacg 
cacgctttag 
gtactgaaca 
aacctcgatc 
acgccagaat 
gacagcaatg 
caccccaaca 
gtcgaaatgg 
gtcagcaaca 
gatcacaaac 
ctggacgaaa 
aaggaagcct 
gcgctgcccg 
gcccagatcg 
aggttcgaac 
ggttcaaaca 
acattcgggc 
gccaacatct 
attcagcatc 
ctgttcaagc 
gacctgaaag 
gtcaaggaac 
tcgatcggtc 
cagcatggcg 
ctgggcacaa 
gccttgactc 
aagcatcaag 
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37681 
37741 
37801 
37861 
37921 
37981 
38041 
38101 
38161 
38221 
38281 
38341 
38401 
38461 
38521 
38581 
38641 
38701 
38761 
38821 
38881 
38941 
39001 
39061 
39121 
39181 
39241 
39301 
39361 
39421 
39481 
39541 
39601 
39661 
39721 
39781 
39841 
39901 
39961 
40021 
40081 
40141 
40201 
40261 
40321 
40381 
40441 
40501 
40561 
40621 
40681 
40741 
40801 
40861 
40921 
40981 
41041 
41101 
41161 
41221 



ggctgaaact 
aggatcatgg 
gcgcgctgct 
aaaagctggc 
acatgggctt 
tcaagtcgtt 
gcaaggacca 
gcgacgacga 
ttcttgccga 
acatactcaa 
cgggaaacgt 
caaataatcc 
gcctgggcgt 
atgttccgga 
cattgcaggt 
tcgacctcac 
gagacccgaa 
ctgcgaacat 
acaagaccga 
ccggcgggca 
cctccgcccc 
tcaatttcag 
agccgataac 
acaacacgac 
gcaagaaacc 
acataccacc 
cggcggtcga 
aaaccagcaa 
gttccgtgcg 
aggacccgaa 
gcgtacggct 
gcaccatgac 
agcgtctggt 
tggtcagcta 
tcgtttatgg 
gaccatcggc 
agagctaata 
gcctgtaaaa 
aacgctggct 
tcgaccgcaa 
agcaaatcgg 
ctgtgctcgg 
acgccgttct 
gcgaaatcgg 
catgccggac 
aagccctctt 
cgtgctgacg 
atctgggtag 
gggataccgc 
cggatgttat 
ccgctttcaa 
aggacaccga 
tgccacttgc 
tctgaatacc 
ccataccttt 
gaattttgcg 
tggcagacac 
tgatctggtg 
aagccatcag 
gcgcgcgttc 



cagccaccag 
cctgagcaaa 
cgaccaggtc 
gacgctgcgt 
taccgataac 
caaaaaagcg 
ggccgagctg 
agtcgggctg 
caaggctaca 
tgccgagcgt 
gagcggcggt 
tgcacgcagt 
ggacgtgacc 
tgaagacatc 
gctgaaaaaa 
ggcaggtgga 
tgccgacccc 
caccgttaac 
actgaaggaa 
gcttcgcgct 
aggccccact 
tgtggaaaac 
gactgaaggt 
caaagcaaaa 
ggatgaggtt 
gcccaaagac 
gcatcgggca 
aaccaacctc 
cgacgcgagc 
acttcgcgcc 
ggaaccgaag 
tcaaagcgac 
ggtattccac 
taacagtgga 
cgcagaccag 
atcgctcaag 
acgaaaacag 
aagcacgcgc 
catcgagtga 
accagcagcc 
gggctcgttc 
gcaattcaat 
gcaaggtcag 
gttgcgaagt 
agcaggcgcc 
caagtgccct 
cgactgcgtt 
gcaactgcaa 
tcaacgtgct 
tcaaagcgtc 
cgccttggct 
acccgtcgct 
gccttcagcc 
cgaaagagcc 
gagcaggtcc 
cccttgcggg 
cgccccgcca 
aacatcgttg 
ctcacctacc 
acttttcacg 



aaagccgaca 
gcgcgcctgg 
gaacagctac 
gatgtgactt 
aaagcgctgg 
gaccatgccg 
gccggaaaat 
cagcgcagct 
gggctctggc 
tgcgagggcg 
ttcggtgccg 
gttgatgtcg 
gccaccgtcg 
gacgcattcg 
gcagtggacc 
actgccgata 
aacagcgatt 
ctgatgacct 
ggcggtaaaa 
cagatcggcg 
cccgcatcac 
aggacggtca 
ctgagcaaat 
ctggcggagc 
attcaggcgc 
aacgacaagc 
tcagccaaca 
tccggcctgt 
gccccgggca 
atgctcaagg 
gactcactgg 
ctctccagca 
accgcgaccc 
gcgaatgtga 
gacaagccga 
gaagcggctg 
taaaaaaagc 
ttcacgtgcc 
ggccagttca 
atgcaagcgc 
cgggcagcgg 
gatcgccgct 
tgacgtgccg 
tttcatcgtc 
tgcagcctgt 
caatgcgtca 
caacacacct 
tgcgcctcgt 
cttctggaac 
tcggtacggt 
tctgcggtaa 
gccagggcca 
tgagcggtca 
ttgctgtaga 
tttttcagat 
tcgacataat 
aaaccggatg 
agcatctggc 
ggatgggtgg 
aacgccttgt 



tacctttggg 
cgctggatct 
cgccgcaaag 
acggcgaaaa 
aaagcggtta 
tcagcgtcaa 
tcaaaagcat 
acggagtgaa 
caacggcagg 
gcgttacgct 
gcaaagacta 
gcaacaaccg 
ccgccagcca 
tcgacgacct 
atgagagcta 
tacgcgccgg 
cgttttctgc 
acaccgatta 
accgcccgcg 
gcagccacac 
aaacagccgc 
aacggatcaa 
tgtcgaaggg 
tggccgaccc 
aactcgacgg 
agtacaaggc 
agcacagcgt 
ccagtgaaag 
atgcgacaag 
agatggaggg 
tcgacaagat 
tgctggagga 
aggctgaaaa 
gcgtcactaa 
ttggttacac 
gcgacttgaa 
gccgcattga 
tgggaaatga 
cgctgcgcgc 
gctacgtcga 
ccaatgcggc 
tcgttgttct 
agctgggcgc 
atagtccttt 
gtccggcgcc 
tcgtcttttg 
tcatccacga 
cccatgtgat 
gtatgtggca 
ccagcatagg 
ccgactggtt 
tgttgcgcaa 
caggcggcag 
acgtggtgcg 
cgctctcggc 
tcgacttcaa 
ccagagctct 
gcacagcctg 
acgaaccctg 
cctgagcgac 



acagcgccgc 
ggtcacactg 
cgacatagag 
cccggtcaag 
cgaatcggtc 
tatgcgcgca 
gctcaagcaa 
cctcaccacc 
tgccaccggt 
gtacctcatt 
ctggccgggc 
cacactgacc 
gcgcgccggg 
gtttgaaggt 
cgaggctcgg 
aataaacctg 
ggtagtgcgc 
ttcgttgacc 
ctttttgaat 
ggcccccaca 
caacaacttg 
gtttcgttac 
ccttggggaa 
tctgaatgca 
gcttgaagaa 
attgcgcgac 
gatggacaac 
catacttacc 
agttgccgaa 
cagtatcggg 
cgatgaaggc 
tcgcaacgag 
cttcacctca 
aacactgggg 
cttcgacggc 
gaaagagggg 
agtggcgctt 
acccgcgcgt 
atagacggac 
agttcagact 
aatgaaagat 
gaccgtcata 
ccagagaatt 
aaggttaaaa 
gggattaacg 
tcggctgctt 
cccgaaccgt 
aggcgttttc 
gagactccct 
tgttgcaccg 
ggtgtacaac 
aatagccccc 
tgccgaggtc 
taccgacggc 
gcggtccggg 
ttgcagcagc 
tgcactcagc 
agaaccaccg 
aaccttcttc 
ttcctcgggc 



gatgccagcg 
aaaagccttg 
ccgttacaaa 
gtggtcacag 




gccacaggca 
ctggagcatg 
ccgttcatca 
aaccgtaact 
agcgaaggtg 
ttttttgacg 
cccaactttc 
gtggtcttca 
cagttgaatc 
cgattcaact 
accgaagacc 
ggcggattcg 
cagaaaaacg 
aacgtgacgg 
ggcacacccg 
ggcggagcgc 
aacgtcgcca 
gcgttcctgg 
cgctacacag 
ctgtttgccg 
ttgaaacgcg 
gcacgctttg 
aaaataatga 
ttcatgcgcc 
acgctggcac 
agcctcaacg 
atgcgcatca 
ccaacaccgt 
cgcatcaact 
gaattgtcac 
ttcgaactga 
ttttattcaa 
cacgtcacaa 
atctccctga 
caacagacgc 
gaccatctca 
aagagcgcat 
gatgaagcgg 
cagcatgaag 
cgggtcaagc 
aagcgcctcg 
atccacggcc 
cgcgactcgt 
gttcggatga 
cccatgcctg 
gtggacagat 
gcactgagcg 
agtgcagaac 
tcgcgcaggt 
gtaaataccg 
gtttgcgaac 
gtctgcccat 
aaggcactgt 
tggttcagca 
gtttttttga 
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41281 
41341 
41401 
41461 
41521 
41581 
41641 
41701 
41761 
41821 
41881 
41941 
42001 
42061 
42121 
42181 
42241 
42301 
42361 
42421 
42481 
42541 
42601 
42661 
42721 
42781 
42841 
42901 
42961 
43021 
43081 
43141 
43201 
43261 
43321 
43381 
43441 
43501 
43561 
43621 
43681 
43741 
43801 
43861 
43921 
43981 
44041 
44101 
44161 
44221 
44281 
44341 
44401 
44461 
44521 
44581 
44641 
44701 
44761 
44821 



ccagctcacc 
cagccttcat 
gaaacatcgg 
ttatcgaagc 
tgtcgctggt 
ccgcagccgc 
cgagatcgct 
cctccggcga 
gcgcctgcct 
gatccagcgc 
ttgccttgct 
ccttgtccgc 
agtgctccag 
ctttatccgg 
cctgaaactg 
gcaaaggctc 
gcgcgccgct 
gtcgagccgc 
ctgactgatc 
tctgggcggg 
cgccgatccg 
tgctggggca 
atgctcgcgc 
ggcagaggct 
aaaggaaaat 
cagcaacgcg 
ggcacggcct 
attggcagat 
gataagggtg 
gaataccagt 
cttgcgttgg 
caaaacggga 
tccacatccc 
gcgtcggtcc 
ggccgcaaca 
caaacacgtt 
gcagcgacac 
aggacgccag 
tcatgtcgtt 
agcctgatag 
gcgggggcgg 
caggcggtgg 
gcacacccac 
agggtggcgt 
ctggctcggt 
aagacaccat 
ccgacaaatc 
ctgaaggcgc 
tgaaagccaa 
aagacctgat 
acagcagtgc 
aaatcgacaa 
agtttgatga 
tggtgaaaag 
tcaaacacgc 
caagtagctt 
acagctcaca 
ccgtgccccc 
tattgcgcaa 
tcgccagttt 



gtgttcgctt 
tggcaccatg 
cttgaccacg 
cttgagcatg 
ggacaaacgc 
gacaatcggc 
gacgacgttg 
cttgacgaaa 
gtcagcgtgc 
actgatgtgc 
tgcacgaccg 
ttttgcatgc 
ttgatcagcg 
tatctgaccc 
cttcagttga 
ggcctgcagc 
gaccggagcc 
gacattcagc 
gggagtcagc 
aacagtatcg 
cgaactgatc 
acaggttggt 
cactctttgg 
cgttcaaggg 
gccgggctgt 
ctctcacggc 
tcgtcgcggt 
tcatagaaac 
tggtactggt 
cttcctgctg 
ttgggcatat 
accggtcgct 
tatcgaacgg 
aattgcccac 
gaccaccacg 
cggcgagcag 
acagaaagac 
caagcccaac 
gctgcagatg 
ccaggctcct 
cggtacaccg 
cggcggtgat 
tgcaacaggt 
aacaccgcaa 
gtcggacacc 
caaggtcggc 
tatgggtaac 
tacgttgaag 
aaacgctcag 
tacggtcaaa 
caaaggtgca 
cttcaaggcc 
catgagcatc 
cgacagtgac 
ctacgataaa 
gaaaaaaggg 
gattgcttac 
ctcttctata 
gctggcgccg 
gacggtctgg 



ttcagctcga 
ttcaggcgtt 
ctgttgaccg 
ttggcgtcgc 
acataaccca 
cctgcacctt 
tccagttgcg 
tcggcgtgca 
agagactcct 
tcatccagcg 
gcgtattcgc 
tggcctaccg 
accgactgag 
ggctgggcga 
tagcgctcag 
agactaccga 
gtcccatgct 
cgcgccgcgc 
ggcggattca 
tgctgctggt 
attggaatct 
ccgtcgagga 
ccaggtcgga 
ccacagagcc 
gggcgcccgc 
cgcgcgcggg 
cctgaaactg 
gttcaggtgt 
cattgttggt 
gcgtgtgcac 
aaaaaaagga 
gcgctttgcc 
acagcgatac 
ttagcgaggt 
ccactcgatt 
aacactcagc 
gtcaacttcg 
gacagccagt 
ctcaccaact 
ttccagaaca 
gatgcgacag 
actccgaccg 
ggcggcagcg 
atcactccgc 
gcaggttcta 
gctggcgaag 
ggagaccagg 
aatgtgaacc 
gaagtcacca 
ggcgagggag 
gacgacaagg 
gacgatttcg 
gagctgaacg 
gatctgaagc 
acccaggcat 
ggtggactcg 
gaccgcatag 
tcagcttcac 
gtatgggtga 
tcggctacgt 



aggggtcagg 
cgttgaggcc 
tctcgtgagc 
tgctggtctc 
agtgtgtcat 
tccagccacc 
tatgtgcggc 
aacctaccag 
tgttgccctg 
acgcgatgct 
caagggcagt 
ttgcgggcga 
caaaaccctt 
atttttccag 
gagacaattt 
tcaacaacgc 
cagccttgaa 
cggcagacga 
tgcctgcagt 
ttacccggct 
cccaggagcc 
gcctgcagtt 
aaacgacttc 
catcagcagc 
gaacatgtga 
caacgcgccc 
cagggtgaag 
ggaaatcagg 
catttcaagg 
actgagtcgc 
acttttaaaa 
actcacttcg 
ggccacttgc 
aacgcagcat 
tttcggcgct 
aagcgatcga 
gcacgcccga 
ccaacatcgc 
ccaataaaaa 
acggcgggct 
gtggcggcgg 
caacaggcgg 
gtggcacacc 
agttggccaa 
ccgagcaagc 
tctttgacgg 
gcgaaaatca 
tgggtgagaa 
ttgacaacgt 
gcgcagcggt 
ttgtccagct 
gcacgatggt 
gcatcgaagc 
tggcaacggg 
cgacccaaca 
tcgagtccac 
gccgaaacgg 
gagccgggcg 
tcgcctcccc 
agcctgtggt 



aataaccgta 
agtcttctgc 
aatgcccgcc 
gggaatcgtg 
tgaagacaag 
caccgtgtta 
gaccgaagca 
ggtggttttg 
ttcggcatct 
gttgctcagg 
ctgactgacg 
agcgtcatgc 
gatcagttgc 
ccgctgctgc 
ctcggccatg 
agcacgcgaa 
ggcctgcaaa 
gctttctgtc 
gactgcattt 
gagtttgacg 
gaaaggctct 
gtggcctgcc 
atcaacaaca 
acacgaccgg 
aagttgatgt 
atgtcaccgt 
tccacttcgc 
ctgagtgcgc 
cctctgagtg 
aggcataggc 
acagtgcaat 
agcaagctca 
tctggtaaac 
gagcatcggc 
aagcggcaag 
cccgagtgca 
cagcaccgtc 
taaattgatc 
gcaggacacc 
cggtacaccg 
cggtgatacg 
tggcggcagc 
cactgcaaca 
ccctaaccgt 
cggcaagatc 
ccacggcgca 
gaagcccatg 
cgaggtcgat 
gcatgcccag 
cactaatctg 
caacgccaac 
tcgcaccaac 
taaccacggc 
caacatcgcc 
caccgagctt 
ccccttttta 
tatttcactt 
ttgacgcagg 
gcccatgtct 
actggatgca 



ttggtatcca 
aaggcggcct 
accatcccga 
tctcgcagct 
aactgcggaa 
cgggcagtga 
aggcgcttgg 
gcgtcgacca 
tgcagagtga 
ccttcgccga 
gcaagcgtcg 
atcagttgaa 
ccgacctcgg 
aagtccgagc 
acttcaaaag 
ctgatcatcg 
agctgtgtgt 
gcgtgtgacc 
gggtgagctg 
ccaccggccc 
cgcgtttggc 
ccatgaatcc 
gcacgccttc 
tctgcgcatt 
ccatcaacgc 
agatcagaac 
tgattttgaa 
agatttcgtt 
cggtgcggac 
atttcagttc 
gagatgccgg 
accccaaaca 
cctggagctg 
atcacacccc 
agtcctcaac 
ctgttgttcg 
cagaatccgc 
agtgcattga 
aatcaggaac 
tcggccgata 
ccaagcgcaa 
ggtggcggcg 
ggcggtggcg 
acctcaggta 
aatgtggtga 
accttcactg 
ttcgagctgg 
ggcatccacg 
aacgtcggtg 
aacatcaaga 
actcacttga 
ggtggcaagc 
aagttcgccc 
atgaccgacg 
tgaatccaga 
ctgtttagct 
ggagaagccg 
ttattgaccg 
ttgacggtct 
gtcgatttca 



http://www.ncbi.nlm. nih.gov/entrez/query .fcgi?cmd=Retrieve&db=nucleotide&list_uids=1332„. 2/4/03 



44881 
44941 
45001 
45061 
45121 
45181 
45241 
45301 
45361 
45421 
45481 
45541 
45601 
45661 
45721 
45781 
45841 
45901 
45961 
46021 
46081 
46141 
46201 
46261 
46321 
46381 
46441 
46501 
46561 
46621 
46681 
46741 
46801 
46861 
46921 
46981 
47041 
47101 
47161 
47221 
47281 
47341 
47401 
47461 
47521 
47581 
47641 
47701 
47761 
47821 
47881 
47941 
48001 
48061 
48121 
48181 
48241 
48301 
48361 
48421 



ccgtgtcctg 
tggtccagcc 
ccttggtcgc 
caaagccacc 
ttaccgcacc 
cgttgagcgc 
cgttttcttc 
cgccacgctg 
cagccagacc 
acgccagtgc 
caacccccgt 
ccatcgcatc 
ggtccatcat 
gggtcagcgg 
actgaagcaa 
ccaatcggtc 
cttcgaagtg 
cgtcaactgc 
catgatctgt 
atcgcacaga 
tcacaggcgg 
aagtttgaac 
atacagtgaa 
tttaaaagga 
ctgcgtacgc 
acgcaaattg 
cgcttagcga 
ttcgagaaaa 
tggcttctgc 
atcgacttcc 
tttcgcaact 
gtcaacactt 
gacctcaaca 
ctgcgcgcgc 
gacgagattc 
gctgcggtta 
ctggaatcga 
acgccactgc 
ctgatgccgc 
gtaggctgca 
aacagtcagc 
tcgatgttct 
tggcgcgtgg 
ggcgcgaggc 
gttgctccgt 
ggctcatgtt 
gacatccctg 
tctggtcttt 
ccgcatagtg 
gaagtgtgtc 
cgatatgctg 
tgtgaaagaa 
gccccagaca 
catcgatagt 
cgtgctcttg 
ccgccatagc 
tagagaccat 
cgggactggg 
cctcttcatg 
acgcatccga 



tatgaacgac 
tgcgaaaacg 
cgggtcggtg 
cgccagggcc 
cattgcgtcg 
cgcacccgag 
gctcagttga 
gtgatcacgc 
acccgccatc 
cggagccaat 
gtccagaacc 
gtggagcctg 
cttggtgccc 
tttgagcgga 
cgaggccatg 
gagcttttcc 
caggcggctg 
agcttggcca 
caggtaatcg 
gctggaggca 
ttgttggacg 
agcgcagtgc 
ctgtcgatca 
tcgacgaagg 
ccactaccaa 
aaattaagcg 
gtaagcgcct 
aatggcggtg 
tctgggacgt 
ccctgatgcc 
cgagtgccta 
cacgcagttt 
accctgtcaa 
acctcaaagg 
agcgcgccag 
tctcgcaagc 
ccatggtcga 
cctaccccta 
tgagcatggt 
tgctgctggc 
accggatccg 
cttcgccaga 
ccaacgcatc 
ttatcgcaag 
gccacgccag 
gctgaagctg 
actgtcctga 
accgggctgc 
cgcgtgctgt 
cagaagcata 
gaagcccatt 
ggcctcatcc 
agcgtgcccg 
tttttcaaat 
ggcaatcttt 
cttgattctg 
cagatccggt 
aagatcagcg 
cccctccaat 
cgaaccgggg 



tcggcttttt 
gctgccgaac 
atatttttcg 
agaccgtttt 
gtcgccatat 
tagctggccg 
gccttgggct 
gactgcacac 
gatacaccaa 
acggtacgta 
tgtcgagcaa 
tccggcgagg 
acctggtcca 
gccggcagcc 
gcaaagggcg 
gccttggcga 
gcgcgcgtct 
tcagccgaat 
gcaatcgcat 
agagacgcgt 
cgtcggttga 
tgaagcgggc 
aacagcgcca 
ctgtgtggtc 
ggactgcgcc 
agctttaagg 
tcttccaaac 
tttcacccga 
ggccgtcacc 
cctcacgttg 
taaccgttgg 
tggccggcag 
agccatactc 
cgacgtcaaa 
ccagagcaac 
ctttgccgcc 
tctgtccaac 
cgtttatttc 
caccaccctg 
aatggaccgc 
catggaagac 
gaggcagccg 
aattggcggt 
tgaaagtctg 
tgcgtaccta 
tctgcctgaa 
tgcagagcca 
aacactgctt 
gctctgccca 
ggtgctgcgt 
accctgggta 
gactgccctt 
tcgccacccg 
agaaatttgc 
cttttggctt 
gtcttgatgt 
agcagggtac 
gcatcgaccg 
gggacaaagg 
gcgagtccgg 



tcaccgcggg 
ctgccaggtt 
tcgccatctc 
gggtcaggct 
ccagtggcag 
atttgattgc 
ctttatcctt 
tgagcaggcg 
ggtccacagc 
cggcgttgcg 
ggcttggcga 
cgctcaggta 
tggcgcccga 
aatcgccctt 
tcgcccgcaa 
aggtgtcggc 
cgatcagcgc 
cggccggcgg 
ttatctcgcg 
cggacgctgt 
tgtgcatgga 
gtgtccggag 
gaaacagcga 
ccggatcggt 
gaaaaatcac 
atggcagcgt 
cagcaaagga 
accgtgacct 
gtggacgtca 
ctttgctcgg 
tgggaagcgc 
gtactgacgc 
tttcaacgtc 
acagcaaaac 
aacttcccca 
ggccagttcg 
tgtcagggcg 
ccacggctgt 
ggctggttca 
atcggtacag 
ctgtgcaaca 
ctgctggctg 
ctgagcaggc 
ctctgggcac 
cgtcgcgctt 
ccacgccaaa 
tcgcatggct 
tgagatcgcg 
gcccttttcc 
tctgcaactt 
gcaatgcatc 
ttgcacggct 
cgcggccata 
tctggtgaaa 
cgatgttcgc 
attgcgtggc 
gcaacgaatg 
acgaaaagga 
cgcccgcctt 
acaatgacga 



atcggttgtc 
ggtcaactga 
ctgcaacttg 
ggacgctgac 
accggccatc 
tttataagcc 
caaaccgagc 
gttgccaaag 
accctgcacg 
cgccgagtac 
gtggcgcttc 
atgcagatca 
cagcgctccg 
gttgatcgca 
cgcgcctgat 
gatggttgcc 
agtgatctgc 
cagtttatgc 
ttgctgatcg 
ccgaaagcta 
aattccctct 
cgactacttg 
aacgtccggt 
tgacggttcc 
cgtcgtttgt 
aagttcacaa 
gtgccgcaat 
acgttggctg 
tgctgataga 
cactgatcgt 
gcaccttgtg 
tgatcgatgg 
atgtggctta 
tcgacgggtt 
atgacatcct 
acagcatccg 
gcatggagcg 
tcagcacgct 
ccccggcgat 
acctgcaagc 
ccatcgaaaa 
acctgaaaag 
agaaaaacag 
catttcgctc 
gaacacatca 
aagaggatca 
atcactcaaa 
atcaaggttt 
aagtgtcatg 
gtttgaatag 
gccctgatag 
ctgacaccaa 
gtcagcagca 
cgggtggaca 
agtcgcgcct 
gccgtcacgt 
aagctggggt 
agagcgcgca 
ttcgggatga 
gggcttatcg 



agcgcggccg 
ctgaccgcgg 
cctacccctg 
accaggcttc 
cgcttgccag 
tcgagccagt 
actaatgcac 
cctgcgttgg 
gcgggtctgg 
gtctgaaccg 
accgaagcgg 
cccgtcgcgc 
gaaatgagcg 
ggctgcatgt 
gtagtcgtcg 
ggggtttccc 
gcattgtgta 
gcagcgaaca 
gagctgacag 
tccgtcgcag 
cgttctacgg 
cgtgaaagca 
cgtccgccgg 
actgaataat 
gttgcagatt 
catggcttgg 
gtctggtcct 
gtcgctgttc 
aggcaaaggc 
gctgatcagc 
gggcgcaatg 
cgaacgggat 
cttgcgtgcc 
actgtcgccc 
caatggctct 
tctgacccgc 
catcgccaac 
gttctgcatc 
ctccacggtg 
cccgttcggc 
gaacctgcaa 
ccccgtaccg 
gttaggggaa 
agttgcagac 
gcaagaaaat 
aaaaaatgca 
aacagaagca 
tccagagcaa 
cccaacttgg 
gcctgctgct 
tcctccagtt 
tttactgata 
aacgctctat 
agctgacagc 
atgctgttgt 
aatgaggcga 
tgtacctgct 
tcgaaaaaga 
aaacgggcga 
tgttgcgtct 



http://www.ncbi.nlm.nih.gov/entrez/query .fcgi?cmd=Retrieve&db=nucleotide&list uids=1332... 2/4/03 



j 



48481 
48541 
48601 
48661 
48721 
48781 
48841 
48901 
48961 
49021 
49081 
49141 
49201 
49261 
49321 
49381 
49441 
49501 
49561 
49621 
49681 
49741 
49801 
49861 
49921 
49981 
50041 
50101 
50161 
50221 
50281 
50341 
50401 
50461 
50521 
50581 
50641 
50701 
50761 
50821 
50881 
50941 
51001 
51061 
51121 
51181 
51241 
51301 
51361 
51421 
51481 
51541 
51601 
51661 
51721 
51781 
51841 
51901 
51961 
52021 



tagcggcaac 
gtcatgaacg 
gtttttccgg 
aatgtgctaa 
tgcacactgc 
gtgcgagcag 
atcgtcagcc 
agcagacact 
atgcctcgat 
attccccgct 
tagggtttgc 
ggcaaaggta 
atttgttggg 
tatcaatgtc 
tctatcccgg 
ccatacttat 
tcaccgcttt 
gattgaacgt 
ttttggagag 
cctcgtcaac 
tgaacaggcg 
tctttccaag 
tatcaccccg 
gaagtacgac 
cctcgatcaa 
gaacctgaca 
atccacagtg 
cagcaaaggc 
attcaagcac 
gttaaaggct 
cgtgagcaag 
cgtcgaggac 
tggagcgtat 
gccctgctcc 
aaggttagga 
ctgaatatcc 
attcagatag 
cgcgtcaccc 
ggcgaagtgc 
gcggccgtcg 
atggggaccg 
cggcgcgagg 
ttcttcgagt 
catgtgggtc 
gagtgccgtg 
gccgactcac 
attatcttgc 
tgtgtgtcct 
acctgtggtg 
acgtcgtttt 
acattgtcat 
taagcccgat 
cactgtcaat 
ccgacatcgc 
cttcagattc 
cgcctcgcct 
gtttggttat 
tcagggcggg 
cggtgcagcc 
gaccctgcgc 



ccctgattgg 
ttcgtggggt 
cttgctcctg 
ttcgcgtcat 
gcaacaacag 
gctccagatt 
tttcgatctg 
cgcttccatt 
tgcgcagcca 
cgtttaacga 
actaatatca 
acgggatggg 
tgacgttaaa 
acttggcggc 
cactcgccaa 
agacgtgccg 
tctgcccgaa 
gctcggttga 
cacaccaggg 
tgcctgaaag 
cacggcgaaa 
gccttgacct 
ggcatgcgga 
catgaggcat 
gcgggtaaga 
aggcaattca 
gtcctgcgct 
tgcaaccagt 
ctgcaagccg 
catgccaaga 
gctgatcggt 
tgaacagcga 
tgcaaggagc 
agcgcctggc 
tgcggctgca 
gcgccactca 
cccagatagt 
aggcgaccga 
agccattgta 
ccatgacggc 
tcttcgatca 
ttggttttcg 
aaccacagaa 
tccgctgggt 
aatcgaatgt 
cgcgaagctg 
catcgctgaa 
gccccagaaa 
cttcaacgac 
ccagcattgt 
cgacgaagtt 
agtgatcggt 
catccagata 
cctaagcctc 
atgtcttcaa 
aaccgactga 
gacaccggta 
ctgggtttga 
ttcggctcac 
ctcctgtcgg 



gcgccagatt 
cagatggaca 
gcgtcgataa 
gaggtgatca 
ttcccttgaa 
cagcgccatt 
tgtgaagatg 
cgcggtcctt 
ctgataaagc 
tgattttcct 
atgcgattct 
cagcgagttt 
acgaaggaat 
tgctggagcc 
ggttgggcgt 
ttttccctcg 
aatcttggcg 
tccatatttt 
attcaaaccc 
ccgcaacgta 
aattcctgcg 
cttgatgcgc 
tcacgcgaaa 
tcacccttgg 
aacctgaagc 
gtaatgatct 
ggcgaaaaca 
gcacaccacc 
agtagaagca 
aagccagcgc 
tgaagctggc 
cgtttacgcg 
ctgttcaaca 
gatacgcagg 
gcattccctg 
attcgtcgcc 
tggccagttc 
cgtacaggtt 
cgtactcatc 
ggatcaggta 
ccggggattt 
ggtcgcgctg 
tgcgctgcga 
gagagtggga 
cctctggcga 
acgctccact 
tgtgtagaac 
tcgaccctgt 
cagcaaatcg 
tttgtagccg 
gcccaactgg 
caggttcatg 
acccgcacag 
acatctatgt 
gtagcactac 
ttttcatctc 
ttatcgncgg 
atgcctacag 
tggccagtgg 
tgctgttcat 



gctggatata 
gccggtaaga 
tcttccagat 
agtccggtct 
tcagggttat 
gccagaatca 
aacaacgaag 
acgttgtggc 
cgatcttttg 
ctgtggttca 
tgtaaaaatc 
ttggtaacgt 
gtatgcttaa 
tgatgattca 
ggggaaccca 
cgttggacac 
gtgatgaccg 
tactgcgaca 
gccttaaaag 
agtaaaattt 
ccgcatgctc 
ttgcgacgta 
ggcctccgat 
cctcgaatcg 
cctctgctac 
ggacttctac 
cgagcaggtc 
agaaccgggt 
catgaaccgt 
taaactggca 
ggcagagtcc 
ccaccggtat 
gctcacttac 
tctttcctgg 
cattttggcg 
cagcagataa 
agagtgaatg 
gagcatcagc 
gtaggtggcg 
atcgacgatg 
gcccagcgga 
gtagcgtttt 
acgtgagttg 
tgtctagaaa 
cctcagacgc 
gccgctttat 
acattttcgg 
ggcgagcagt 
atcttgaaac 
gaaaggctga 
tgccaactac 
gcgcgccctc 
ttttaacaga 
actggcgcga 
agcagcggct 
cgtacttgtg 
cgcattgccc 
cgaagggatg 
ctatatttcc 
cgcgggtgcg 



cataaaccgc 
accgaggctc 
agcgctgcaa 
catccagatc 
agccaagcgc 
aaatgacgtt 
tgtcctgttc 
gttgaccctc 
cctcgacagg 
agacgtgatg 
gactcgtgag 
tgccgttgtt 
aaaatgcctg 
tctggacggc 
taacggaggg 
actgctgctg 
caaatgtcag 
gaagagtgcg 
ctttatatgc 
tgctccgctc 
cacaagtcga 
taaccgtcgt 
acctgccaga 
attctttccg 
tgcctcttgc 
ctgtttcatc 
tggacagaat 
tcgacagtta 
cgcaagaaaa 
ccggcaaaca 
ggtaacgacc 
ggtcaggctg 
ttcgcaaacg 
catcgttgta 
aattcgccaa 
ggcgtcagcc 
cgcggatgca 
ggcagaatgg 
ctggcaggat 
gcgccagact 
tgaatggcct 
atctcgtacg 
ttcaggtggt 
aagactgctg 
gtctgtcggc 
cgattaccga 
aaaaggtgat 
tgaagaccag 
gcaagtcggg 
tcagctcacc 
ggtcattcag 
cttcaggtgc 
gtcataggga 
cgctggtttc 
gacacgcaag 
gcaaccatgg 
ttcatgacgc 
atcacggctt 
gaccgtttcg 
ctgggtacgg 



cctctgtcag 
tttctgggcg 
cgagacggcc 
cgccattgag 
agcgccacct 
gtcctgcggc 
tggcaaccag 
ctgcgcatcg 
ccgcatggaa 
cggttccctt 
tgccgccgat 
gcagggttga 
ctactggtta 
gagcgttgca 
cagagttggc 
ccctacgacc 
ttcagtggag 
gccccgacgc 
gtggcatgca 
ggagtatcag 
ttcaccagag 
agcctgccca 
gcgccgtcca 
gacctccact 
tccaggatca 
taatgacctc 
gcatatgcaa 
agctgatatc 
tacagcaact 
aatccagcta 
cgatcagttc 
ttcattccga 
agtactcacc 
cccaggctgc 
tgaagctcat 
ccagagcttc 
aaggcgcgcc 
ccgaaccttc 
ccggttgcag 
cgataaccac 
tcagctcagg 
gcaggccaag 
ggacaataat 
ggccgccgta 
gcagagcgct 
ccaaacgccg 
gcgccgtccc 
ccgggcagcg 
gataatcctg 
gttgtaatgc 
acaggcgatg 
tcaaagcagt 
actcgtgcgg 
aagcgaagga 
gtcggcaaaa 
gcgcgctcgc 
tgccggccga 
cgctgatcgt 
gacgacgcct 
ccattgcgcc 
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52081 gtccattccg ttcatggtcg ccgcgcgctt cctgctgggt atcgcggtgg gtggcggctc 

52141 ggcgacggtg ccggtgttca ttgccgaaat cgccggcccc tcgcgtcgtg cgcggctggt 

52201 cagccgcaac gaactgatga tcgtcagcgg ccagttgctc gcctatgtgc tcagcgcggt 

52261 catggccgcg ctgctgcaca cgccgggcat ctggcgctat atgctggcga tcgcgatggt 

52321 gccgggggtg ttgctgctga tcggcacctt cttcgtacct ccttcgccgn gctggctggc 

52381 gtccaaaggc cgttttgacg aagctcagga tgtgctggag caactgcgca gcaacaagga 

52441 cgatgcgcan cgtgaagtgg acgaaatgaa agctcatgac gagcaggcgc gcaatcgt 

// 
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